- π©βπ I'm a Statistics graduate from Malaysia.
- π I'm seeking for a job opportunity in data science and analytics fields, or any other related fields.
- π± Iβm currently exploring Data Science.
- β‘ In my spare time, I'm always involved in data mining and machine learning projects.
- π« How to reach me:
β Β Note: The dates indicate the month and year when each project was completed.
- Identification of Disaster Related Tweets using NLP based Text Classification β | β May 2023 β | β Show project
- Developed an NLP-based text classification model in Python to predict whether a given tweet is disaster-related.
- Utilized libraries such as
Pandas
,NumPy
,Seaborn
,Matplotlib
,SciPy
,Plotly
,NLTK
,re
,collection
,wordcloud
,TensorFlow
,scikit-learn
. - Techniques employed: EDA, Text Preprocessing, Classification Model Comparison (Linear SVC, Multinomial NB, Neural Network).
- Achieved an AUC score of 0.86 using a Linear SVC model. (Good separability between disaster and non-disaster tweets)
- Churn Customer Prediction using Machine Learning β | β Apr 2023 β | β Show project
- Visualized the IBM Community's Telco Churn Dataset to quickly gain insights using Python in Jupyter Notebook.
- Developed a churn prediction model using machine learning algorithms in Python to identify whether a customer has churned.
- Utilized libraries such as
Pandas
,NumPy
,Seaborn
,Matplotlib
,Plotly
,H3
,Folium
,TensorFlow
,imblearn
,scikit-learn
,XGBoost
. - Techniques employed: EDA, Data Visualization, Classification Model Comparison (Random Forest, Logistic Regression, AdaBoost, XGBoost).
- Obtained an AUC score of 0.86 using an XGBoost classifier.
- Web Scraping Booking.com β | β Apr 2023 β | β Show project
- Scraped valuable hotel data in Kuala Lumpur, Malaysia from Booking.com using Beautiful Soup library in Python.
- Extracted information such as hotel names, locations, room types, scores, ratings, number of reviews, distance from the center, and prices.
- Utilized libraries such as
Pandas
,Requests
,Beautiful Soup (bs4)
,RegEx (re)
. - Techniques employed: Data Extraction
- Kaggle Titanic - Machine Learning from Disaster Competition β | β Mar 2023 β | β Show project
- Developed a machine learning model in Python to predict survival on the Titanic.
- Utilized libraries such as
Pandas
,NumPy
,Seaborn
,Matplotlib
,scikit-learn
,TensorFlow
. - Techniques employed: EDA, Feature Engineering, Data Visualization, Classification Model Comparison (Random Forest, Logistic Regression, Complement Naive Bayes).
- Achieved a stratified k fold CV score of 0.85 using a Random Forest model.
- Feature Engineering - Convert UTC time to Local time β | β Mar 2023 β | β Show project
- Converted UTC time to Malaysia Standard Time.
- Utilized libraries such as
Pandas
,DateTime
,Dateutil
,pytz
. - Techniques employed: Feature Engineering.
- Data Visualization for Worldwide Movie Series β | β Jan 2023 β | β Show project
- Presented graphical visualizations using Python to highlight patterns and trends in movie series data.
- Utilized libraries such as
Pandas
,NumPy
,Seaborn
,Matplotlib
,wordcloud
. - Techniques employed: EDA, Feature Engineering, Data Visualization.
- Online Payment Fraud Detection using Machine Learning β | β Dec 2022 β | β Show project
- Trained machine learning models in Python to identify fraudulent and non-fraudulent payments.
- Utilized libraries such as
Pandas
,NumPy
,Seaborn
,Matplotlib
,Tabulate
,scikit-learn
. - Techniques employed: EDA, Data Visualization, Classification Model Comparison.
- Obtained an F1 score of 0.79 using a Random Forest model.
- Cookies Sales Dashboard β | β May 2023 β | β Show project
- Prepared data for the cookies sales dashboard using Query Editor in Power BI.
- Created a cookies sales dashboard using Power BI, showcasing cookies sales, cost, profit, lead time, flavour and customer.
- Flight Ticket Sales Analysis Dashboard β | β Jan 2023 β | β Show project
- Queried an airline flight ticket dataset from Airlines Database using PostgreSQL (SQL).
- Created a flight ticket sales dashboard using Tableau, showcasing ticket sales, fare conditions, booking period, aircraft, departure and arrival airports.
- KPMG Data Analytics Consulting Virtual Internship β | β Nov 2022 β | β Show project
- Participated in KPMG Virtual Experience Program with Forage to gain insight into working at KPMG to develop career skills and experience.
- Completed tasks including data quality assessment, data insights analysis using Python (Jupyter Notebook) and data dashboard presentation using Tableau.
- Non-parametric Test for Patient Health Status β | β Mar 2022 β | β Show project
- Analyzed the Patient Health Status using non-parametric tests such as Shapiro-Wilk, Wilcoxon Rank Sum, Ansari-Bradley, Kolmogorov-Smirnov, Kruskal-Wallis and Spearman's Correlation Tests as data distribution assumptions of parametric tests are not met.
- Ecommerce Dashboard β| β Jul 2021 β | β Show project
- Visualized ecommerce data in a dashboard using Google Data Studio, displaying sessions, transactions, revenue, product checkout, average order value, conversion rate, and more.
- Java Application -Simple Student Information System β | Nov 2019 | β Show project
- Writed a Java application to represent a simple Student Information System.