🧜 Caliban

A very simple python app to detect the genre of an audio file using a Mel Spectogram and a simple CNN with PyTorch.

Currently we're trained with just 10 Geners using the GTZAN data set, but working to retrain on the Million Songs dataset for a wider array of results and geners. As shown here http://millionsongdataset.com/sites/default/files/AdditionalFiles/unique_terms.txt

Usage

$ docker-compose up -d application

Then just send it a file.

curl --request POST \
  --url http://127.0.0.1:5000/ \
  --header 'content-type: multipart/form-data; boundary=---011000010111000001101001' \
  --form file=/path/to/file

Million Songs GTZAN Genre Collection

Thanks

Big thanks for cetinsame for the work on the original model.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
model		model
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Living On A Prayer.mp3		Living On A Prayer.mp3
README.md		README.md
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt
test.mp3		test.mp3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧜 Caliban

Usage

Thanks

About

Releases

Packages

Languages

License

waxim/Caliban

Folders and files

Latest commit

History

Repository files navigation

🧜 Caliban

Usage

Thanks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages