my-deep-learning-projects/nlp-projects/spam-mail-detection-w-tensorflow-distilbert at main · john-fante/my-deep-learning-projects · GitHub

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
spam-mail-detection-w-tensorflow-distilbert.ipynb		spam-mail-detection-w-tensorflow-distilbert.ipynb

README.md

Spam Mail Detection w/Tensorflow (DistilBERT)

(kaggle link -> https://www.kaggle.com/code/banddaniel/spam-mail-detection-w-tensorflow-distilbert)

I tried to predict a spam mail with finetuning a DistilBert based Tensorflow model.

I applied several preprocessing operations (cleaning,dropping stop words),
Used tf.data pipeline for efficient training,
I only used only 20 max length for sequence length (bert models support up to 512 input lengths),
Only 18000 samples be used for training (12000 samples for validating and 20000 samples for testing),

Screenshot 2024-03-14 at 8 45 08 PM

My Another Projects

References