List of Projects#

Individual Projects#

Data Analysis and Web-App Development

🌿 Alternative Recipe Python

Alternative Recipe is a web-based application that assesses the carbon footprints of meals, offering sustainable recipe modifications and the convenience of receiving modified recipes via email.

https://jlee2843.github.io/portfolio-book/alternative-recipe.html

NLP and Supervised Machine Learning

📰 Misinformation Analysis Python

This project investigates the pervasive impact of fake news, emphasizing its role in societal disorganization, and aims to provide innovative solutions for distinguishing fake news from true news.

https://jlee2843.github.io/portfolio-book/misinformation-analysis.html

Data Wrangling and Interactive Visulizations

🍁 Environmental Health in Canada Python

This project contains extensive set of data visualizations aimed at raising awareness among Canadians about the prevailing air and water quality conditions across provinces. Prior to creating these visualizations, we undertook thorough data wrangling from multiple sources.

https://jlee2843.github.io/portfolio-book/environmental-health.html

Statistical Hypothesis Testing

💉 Massachusetts General Hospital Study on COVID-19 R

This report aims to analyze and visualize key characteristics of protein and clinical data from the Massachusetts General Hospital (MGH) COVID-19 study, using statistical tests such as t-tests, binomial GLM, and chi-square to evaluate our hypothesis.

https://jlee2843.github.io/portfolio-book/covid-analysis.html

Collaborative Projects#

Machine Learning and Containerization

💳 Credit Card Fraud Detection Python Docker

In this project, we built and compared logistic regression, random forest, and gradient boost classifiers for detecting fraudulent transactions. Despite dataset imbalances, logistic regression emerged as the preferred model due to its swift implementation, interpretability, and practical suitability.

https://ubc-mds.github.io/fraud_detection/fraud_detection_full.html

Research Projects#

Data Pipeline and Web-App Development

🥗 Climate-Friendly Food Sustainability (CFFS) Labels Python SQL

The CFFS project supports UBC’s CAP 2030 commitment, aiming for a 50% reduction in GHG emissions by 2030. A comprehensive pipeline is established, incorporating the analysis and labeling of new ingredients added to the database, along with the development of a user-friendly web application.

https://github.com/jlee2843/CFFS-22-23

LASSO Regression and Co-Expression Analysis

🧬 The SCOPE Algorithm Python R

This project outlines the application of SCOPE-Stabilized Algorithm on melanocytic tumor patients dataset, including bootstrapped LASSO regression. Additionally, it involves co-expression and differential co-expression analyses, along with Pathway Enrichment through Over Representation Analysis.

https://github.com/jlee2843/SCOPE-GSE98394

Machine Learning and Containerization

📖 Peer Review Analysis Python

The ongoing Peer Review Analysis project evaluates the influence of peer reviews on scientific papers, examining both positive and negative impacts. The project compares preprint versions to printed versions, incorporating various metric measures for comprehensive assessment.

https://github.com/jlee2843/Peer-Review-22-23