Toxic Comment Classification

This is my codes for the toxic comment classification competition hosted in Kaggle. Fully modified to another level from the base code here

To download datasets please run get_data.sh

The Task

The dataset comprises of comments from Wikipedia’s talk page edits. It is a large number of Wikipedia comments which have been labeled by human raters for toxic behavior. The types of toxicity are:

toxic

severe_toxic

obscene

threat

insult

identity_hate

The Approach

Creating an ensemble model which predicts a probability of each type of toxicity for each comment.Full explaination of my approach is documented here

Install Pre-requisites

run install.sh and then run pip install -r requirements.txt

Tips

Make sure embeddings original preprocessing is used to ensure highest percentage of embeddings can be imported

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
EXTRATREES_CLASSIFIER.ipynb		EXTRATREES_CLASSIFIER.ipynb
HillCLIMBENSEMBLE.ipynb		HillCLIMBENSEMBLE.ipynb
LGBM_LOGREG_XGB_STACK_LOGREG.ipynb		LGBM_LOGREG_XGB_STACK_LOGREG.ipynb
README.md		README.md
RIDGE.ipynb		RIDGE.ipynb
Train Toxicity Model.ipynb		Train Toxicity Model.ipynb
Untitled.ipynb		Untitled.ipynb
XGBOOST.ipynb		XGBOOST.ipynb
add_covaai.ipynb		add_covaai.ipynb
badwords.ipynb		badwords.ipynb
bagging.ipynb		bagging.ipynb
conv.ipynb		conv.ipynb
convai_feature.ipynb		convai_feature.ipynb
ensemble.ipynb		ensemble.ipynb
fasttext_direct.ipynb		fasttext_direct.ipynb
feature_engineering.ipynb		feature_engineering.ipynb
get_data.sh		get_data.sh
install.sh		install.sh
model_tool.py		model_tool.py
nbsvm.ipynb		nbsvm.ipynb
nbsvm.py		nbsvm.py
requirements.txt		requirements.txt
sample_submission.csv		sample_submission.csv
super_nbsvm.ipynb		super_nbsvm.ipynb
translate.ipynb		translate.ipynb
visuals.py		visuals.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Toxic Comment Classification

The Task

The Approach

Install Pre-requisites

Tips

About

Releases

Packages

Languages

Dicksonchin93/toxic_comment_classification

Folders and files

Latest commit

History

Repository files navigation

Toxic Comment Classification

The Task

The Approach

Install Pre-requisites

Tips

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages