About Me#
Introduction#
Greetings! I’m on an exciting journey in the Master of Data Science program at the University of British Columbia, fueled by a solid foundation in statistics and neuroscience from my undergraduate days. I thrive on exploring the intricacies of data, uncovering trends, and seamlessly applying those insights in unexpected domains.
My latest role in the relam of data science was working as a Data Science Intern at Teck Resources Limited. There, I delved into the world of Hugging Face transformers and the BERT model, tackling NLP challenges with gusto. When I am not exploring through data, you’ll often find me indulging my creative side by sketching and listening to music across various genres.
I am thrilled to share my data journey with my portfolio! Please feel free to reach out through any of the contact information above if you’d like to chat with me. 🚀🎨🎶
Skills Toolbox#
⌨️ Programming Languages
Python
R
Java
SQL (PostgreSQL, MySQL)
📈 Data Analysis
Data wrangling with
pandas
andnumpy
Data retrival and management with SQL
Static visualization with
matplotlib
andseaborn
Interactive dashboards with
plotly Dash
,streamlit
andaltair
📊 Machine Learning
Supvervised learning
Unsupervised learning
Ensemble models
Neural networks
Deep learning
Familiar working with
scikit-learn
,scipy
,PyTorch
,Tensorflow
,NetworkX
Career#
University of British Columbia | Food Sustainability Data Analyst
09/2023 - Present
▸ Part-Time | Python R Git GitHub Docker
Description
- Engineered an automated workflow to evaluate the cumulative greenhouse gas emissions resulting from all procurement items acquired and utilized by UBC Food Services and AMS.
- Implemented machine learning models to gauge UBC’s ability to achieve the Climate Action Plan 2030. In cases where meeting the goal is doubtful, the models offer recommendations to reduce annual greenhouse gas emissions.
University of British Columbia | Graduate Teaching Assistant
09/2023 - Present
▸ Part-Time | R Git GitHub
Description
- Member of the teaching team for STAT 200 (Elementary Statistics for Applications) and DSCI 100 (Introduction to Data Science) at UBC.
- Attended lectures to address students’ questions in real-time. Facilitated a lab section, guiding students through weekly lab materials. Assessed and graded student exams and assignments.
Teck Resources Limited | Data Science Intern
01/2023 - 08/2023
▸ Full-Time | Python Git GitHub Docker Databricks AWS
Description
- Developed statistical solutions to address machine learning model performance degradation resulting from data drift. Automated the process of re-scaling incoming data values upon detecting data drift.
- Explored and implemented alternative machine learning models aimed at replacing the existing high-performance but costly BERT model. Investigated models with enhanced interpretability, such as K-means clustering, logistic regression, and DBSCAN clustering.
University of British Columbia | Climate-Friendly Sustainability Data Analyst
08/2022 - 04/2023
▸ Part-Time | Python R Git GitHub Docker
Description
- Engineered an automated workflow to evaluate the overall impact of newly added food items to the UBC Food Services database.
- Categorized all food items offered by UBC Food Services into green, yellow, and red based on the total greenhouse gas emissions produced by 100g of each menu item.
- Labeled newly added food ingredients in the UBC database.
- Developed a web-based application for Food Services staff to efficiently search for climate-friendly labels.The application incorporates dynamic graphs that change based on the selection of particular residence halls or vendors.
Ciena | Machine Learning and Modelling Intern
09/2022 - 12/2022
▸ Full-Time | Python PostgresSQL Bigbucket NetworkX
Description
- Enhanced an outdated artificial neural network model through real-time data analysis. Conducted comprehensive testing using six different methods and generated a detailed report comparing the performance of each approach.
- Collaborated with colleagues to develop an interactive app using Plotly Dash. This app enables clients to effortlessly monitor their current network traffic and explore future predictions.
Cybera Inc. | Data Science Intern
04/2022 - 03/2024
▸ Full-Time | Python R Git GitHub
Description
- Analyzed, wrangled, and visualized data sourced from an open-source database on trending topics. Gained exposure to various Python and R packages for visualization, including Matplotlib, Seaborn, Plotly, Plotly Dash, and Ggplot.
- Collaborated with colleagues to create hackathon content for Albertans. Prepared and processed open-source datasets to serve as valuable resources for a data science hackathon.