Music Genre Classification

Overview

This project involves building a classification model to predict the genre of a song based on its features.

Process

Data Preprocessing

Removed rows with missing instance_id, artist_name, and track_name.
Replaced missing values in duration_ms and tempo with medians.
One-hot encoded the key column; label encoded the mode and music_genre columns.
Standardized numerical features.

Train/Test Split

Split dataset: 500 songs per genre for the test set, remaining for training.

Dimensionality Reduction and Visualization

Used t-SNE and UMAP for visualization.
Applied HDBSCAN for clustering analysis.

Model Training

Added cluster labels as features.
Used XGBoost for classification.
Evaluated with ROC AUC and accuracy metrics.

Results

Mean ROC AUC: 0.9345 (test set).
Accuracy: 64.59% (training), 58.82% (test).
Observed overfitting; suggested cross-validation, regularization, and further feature engineering.

Conclusion

Feature engineering and data preprocessing were crucial.
Visualized audio features highlighted the complexity of genre classification.
Further improvements needed for practical application.

For a detailed description, refer to report.pdf.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
musicData.csv		musicData.csv
notebook.ipynb		notebook.ipynb
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Genre Classification

Overview

Process

Data Preprocessing

Train/Test Split

Dimensionality Reduction and Visualization

Model Training

Results

Conclusion

About

Releases

Packages

Languages

annsts/music_genre_classification

Folders and files

Latest commit

History

Repository files navigation

Music Genre Classification

Overview

Process

Data Preprocessing

Train/Test Split

Dimensionality Reduction and Visualization

Model Training

Results

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages