CaptchAI

This is a speech recognition project built with the purpose of exploring feature engineering in audio samples and Python best practices.

The challenge is to break a ficticious audio CAPTCHA formed a by a sequence of four characters. The CAPTCHAs were built with audio samples that have been recorded by volunteer students of the Universidade Federal do ABC. The samples were recorded with diverse microphones, in other words, expected a variety of background noises. The character sequence was randomly assembled, so you will find nonmatching voices in the same CAPTCHA.

The proposed solution uses the Random Forest algorithm from the Scikit-learn package.

The original audio samples are not publicly available in order to preserve the privacy of the volunteers.

Getting Started

Prerequisites

You must have Python 3.7 or greater and Pip installed.

Installing

Install the dependencies using the requirements.txt file.

pip install -r requirements.txt

Data Prep

In case you have a a folder with ".wav" samples and would like to use it, you should place them in a "data" folder structured as following and run the data prep script:

./data/training

./data/validation

./data/test

python data_prep.py

Training

In order to train the model you should run the following command:

python train_model.py

Predicting

Run the following command in order to make predictions over the test dataset:

python run_model.py

Graphs

A mel-spectrogram can be generated by running:

python generate_graphs.py

Built With

Python - The programming language.
Scikit-learn - Used to train the model and make predictions.
Pandas - Used to generate DataFrames.
Librosa - Used to manipulate the audio files and extract some features.

Authors

Lucas Monteiro de Oliveira - Coding - Monolli
João Victor Fontinelle Consonni - Report - Cojonni

License

This project is licensed under the GNU GPL3 License - see the LICENSE.md file for details

Acknowledgments

Many thanks to João Victor Fontinelle Consonni who helped with a full report of the project.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
graphs		graphs
libs		libs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_prep.py		data_prep.py
generate_graphs.py		generate_graphs.py
requirements.txt		requirements.txt
run_model.py		run_model.py
setup.cfg		setup.cfg
test.csv		test.csv
train.csv		train.csv
train_model.py		train_model.py
valid.csv		valid.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CaptchAI

Getting Started

Prerequisites

Installing

Data Prep

Training

Predicting

Graphs

Built With

Authors

License

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

License

monolli/CaptchAI

Folders and files

Latest commit

History

Repository files navigation

CaptchAI

Getting Started

Prerequisites

Installing

Data Prep

Training

Predicting

Graphs

Built With

Authors

License

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages