EDUCATION
Ngee Ann Polytechnic – Singapore, Singapore
School of InfoComm Technology, Diploma in Data Science
Expected Graduation: February 2026
- GPA: 3.98 / 4.00
- A*STAR Scholar
EXPERIENCE
Agency for Science, Technology and Research (A*STAR) – Singapore, Singapore
Data Scientist Intern
Feb 2025 – Apr 2025
Supervisors: Bin Wang, Xun Long
- Contributed to data collection and data processing for the National Multimodal LLM Project (NRF Grant: $70M) focused on multimodal and multilingual model evaluation, specifically for the MERaLiON AudioLLM
Ngee Ann Polytechnic – Singapore, Singapore
Student Helper for Module Creation (Data Lake and Warehouse)
Aug 2024 – Oct 2024
- Worked with a large team of 8 to develop slides, tutorials, and a comprehensive teaching plan for a new module taught in the Data Engineering specialization
- Learned more about data infrastructure, data management and code efficiency within Snowflake
PROJECTS
Airbnb Pricing Model – Singapore, Singapore
Python & feature engine | Nov 2024 – Feb 2025
- Built multiple regression models to predict Airbnb prices using features like location, amenities, and property type
- Deployed an interactive web app locally using Streamlit to allow users to input property details and receive real-time price estimate
- Achieved 27.60 MAE and 0.779 R² score on the test dataset
HR Analytics Model – Singapore, Singapore
Python & scikit-learn | Nov 2024 – Feb 2025
- Trained and evaluated multiple models (Logistic Regression, Catboost, etc.) to foster better promotion choices
- Improved classification performance using techniques such as SMOTE oversampling and hyperparameter tuning
- Achieved 96.75% Recall and 86.94% F-Beta (β = 1.5) score on the test dataset
Baseball MVP Prediction Model – Singapore, Singapore
Python & NumPy | July 2024 – Aug 2024
- Extracted, merged, and preprocessed data from multiple tables for machine learning model development
- Created a logistic regression model utilizing oversampling techniques to address the imbalanced dataset
- Achieved 98.47% accuracy and 98.11% F1 score for the oversampled training dataset
HONORS AND AWARDS
- A*STAR Science Awards (Polytechnic) – November 2024
- Second Most Outstanding Performance in Cohort, Year 2, Semester 1 – November 2024
- Third Most Outstanding Performance in Cohort, Year 1, Semester 2 – June 2024
- Second Most Outstanding Performance in Cohort, Year 1, Semester 1 – December 2023
- Director’s List, Year 2, Semester 1
- Director’s List, Year 1, Semester 2
- Director’s List, Year 1, Semester 1
CERTIFICATIONS
- Alteryx Designer Core Certification – Jan 2025 – Jan 2027
- Professional Scrum Master I – November 2024
- Amazon Web Services Cloud Practitioner – June 2024 – June 2027
SKILLS
Python
- Deep Learning (TensorFlow, Natural-language processing)
- Machine Learning (Scikit-Learn, Feature Engineering, Hugging Face)
- Data Preprocessing (NumPy, Pandas)
Extra
- Docker, AWS, Computer Vision, AIGC, PowerBI, Tableau