ai-language-detection

This code is a language detection model that uses the Naive Bayes algorithm to classify a given text into one of the 22 languages present in the dataset.

The code imports various libraries such as pandas for data manipulation and analysis, numpy for scientific computing and working with arrays, CountVectorizer for extracting features from text data, train_test_split for splitting data into training and testing sets, and tabulate for printing out data in a formatted table.

The datasets used in this code contain 39 languages (combined) with more than 1000 sentences from each language, and the output should show the count of each language in the dataset.

The code then splits the data into training and test sets and trains the Naive Bayes algorithm on the training set to predict the language of a given text. Finally, the code prints a table with the predicted language of the input text.

Flowchart


Dataset 1 language count	Dataset 2 language count	Language Prediction

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
dataset1.csv		dataset1.csv
dataset2.csv		dataset2.csv
flowchart.png		flowchart.png
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-language-detection

Flowchart

About

Languages

AmirAliuA/ai-language-detection

Folders and files

Latest commit

History

Repository files navigation

ai-language-detection

Flowchart

About

Topics

Resources

Stars

Watchers

Forks

Languages