EDUCATION

Ngee Ann PolytechnicSingapore, Singapore
School of InfoComm Technology, Diploma in Data Science
Expected Graduation: February 2026

  • GPA: 3.98 / 4.00
  • A*STAR Scholar

EXPERIENCE

Agency for Science, Technology and Research (A*STAR)Singapore, Singapore
Data Scientist Intern
Feb 2025 – Apr 2025
Supervisors: Bin Wang, Xun Long

  • Contributed to data collection and data processing for the National Multimodal LLM Project (NRF Grant: $70M) focused on multimodal and multilingual model evaluation, specifically for the MERaLiON AudioLLM

Ngee Ann PolytechnicSingapore, Singapore
Student Helper for Module Creation (Data Lake and Warehouse)
Aug 2024 – Oct 2024

  • Worked with a large team of 8 to develop slides, tutorials, and a comprehensive teaching plan for a new module taught in the Data Engineering specialization
  • Learned more about data infrastructure, data management and code efficiency within Snowflake

PROJECTS

Airbnb Pricing ModelSingapore, Singapore
Python & feature engine | Nov 2024 – Feb 2025

  • Built multiple regression models to predict Airbnb prices using features like location, amenities, and property type
  • Deployed an interactive web app locally using Streamlit to allow users to input property details and receive real-time price estimate
  • Achieved 27.60 MAE and 0.779 R² score on the test dataset

HR Analytics ModelSingapore, Singapore
Python & scikit-learn | Nov 2024 – Feb 2025

  • Trained and evaluated multiple models (Logistic Regression, Catboost, etc.) to foster better promotion choices
  • Improved classification performance using techniques such as SMOTE oversampling and hyperparameter tuning
  • Achieved 96.75% Recall and 86.94% F-Beta (β = 1.5) score on the test dataset

Baseball MVP Prediction ModelSingapore, Singapore
Python & NumPy | July 2024 – Aug 2024

  • Extracted, merged, and preprocessed data from multiple tables for machine learning model development
  • Created a logistic regression model utilizing oversampling techniques to address the imbalanced dataset
  • Achieved 98.47% accuracy and 98.11% F1 score for the oversampled training dataset

HONORS AND AWARDS

  • A*STAR Science Awards (Polytechnic) – November 2024
  • Second Most Outstanding Performance in Cohort, Year 2, Semester 1 – November 2024
  • Third Most Outstanding Performance in Cohort, Year 1, Semester 2 – June 2024
  • Second Most Outstanding Performance in Cohort, Year 1, Semester 1 – December 2023
  • Director’s List, Year 2, Semester 1
  • Director’s List, Year 1, Semester 2
  • Director’s List, Year 1, Semester 1

CERTIFICATIONS

  • Alteryx Designer Core CertificationJan 2025 – Jan 2027
  • Professional Scrum Master INovember 2024
  • Amazon Web Services Cloud PractitionerJune 2024 – June 2027

SKILLS

Python

  • Deep Learning (TensorFlow, Natural-language processing)
  • Machine Learning (Scikit-Learn, Feature Engineering, Hugging Face)
  • Data Preprocessing (NumPy, Pandas)

Extra

  • Docker, AWS, Computer Vision, AIGC, PowerBI, Tableau