Phoenix Pham

About

I am a fourth-year undergraduate student studying Computer Science and Applied Mathematics at University of California, Berkeley. I consider myself an avid learner, passionate about understanding new things and constantly challenging myself, hoping to expand my perspective on the world. My core interest lie in applying data science, machine learning, and math to push forward innovation in neuroscience and healthcare. Some of other hobbies include problem-solving, basketball, music, art, fashion, and culinary art.

Work

Berkeley Lab Affiliate (Machine Learning, Neuroscience) @ Lawrence Berkeley National Laboratory (Fall 2025)
Data Science Intern @ IDX Exchange (Summer 2025)
Software Backend Engineer @ Stealth beauty-tech startup “StylistGem” (Summer 2025)
Machine Learning Engineer Intern @ Mentia (Spring 2024)

Education

University of California, Berkeley. Computer Science, B.A. & Applied Mathematics, B.A. (August 2022 - May 2026)

Projects

These are some of the various projects I've worked on as of late.

CS 180: Intro to Computer Vision and Computational Photography

Engaging in a series of projects for my CS 180 course at UC Berkeley. Click this link here to see these projects.

Applying DCA to Movie Scenes

Small demo project working with Dr. Kristofer Bouchard on applying the linear dimensionality reduction method Dynamical Components Analysis (DCA) to scenes of a movie I watched. Inspired by ongoing work in the Dr. Bouchard's Neural Systems and Machine Learning Lab (NSML) at the Lawrence Berkeley National Laboratory, which developed Dynamical Components Analysis (DCA) as a method to find population-level dynamical structure in neuroscience data. The repo with my analysis can be found here.

IDX Exchange: Predict California MLS Residential Property Closing Prices

Developed an ML system to predict residential property closing prices using California MLS data, including data cleaning, feature engineering, and training regression models (e.g. OLS, Ridge, Lasso, Random Forest, Gradient Boosting, Extreme Gradient Boosting). I deployed the XGBoost model online via Streamlit you can check out here!

Convolutional Neural Network from Scratch

Using only numpy functions, implementing core components including convolutional, linear, and activation layers to classify the Iris dataset.

Spam/Ham Classification

Implemented and trained a logistic regression classifier via scikit-learn to distinguish spam emails from ham (non-spam) emails.

Exploring & Predicting Housing Prices in Cook County

Implemented various OLS regression models via scikit-learn to predict fair market housing prices and identify system overassessment of low-value homes. Applied data cleaning, feature engineering & selection to improve model accuracy.

Assist People Living with Dementia through Machine Learning

Tasked with designing and developing an ML system to "predict" early signs/symptoms of Dementia through an interactive game for people living with Dementia called DevaWorld. An ongoing project that goes through the whole ML life-cycle— from data collection, to model building, to training, and testing the model. Currently on the data gathering and preparation of the process. Used Vertex AI, Kaggle, and Google Colab services to test various models and parse/clean raw data.

Build Your Own World (CS61B: The Game)

Tasked with implementing my own custom 2D video game in Java during my Data Structures & Algorithms course. Uses A* to generate random hallways and rooms based on seed input.

ML Movie Recommender

Self-project using Python, scikit-learn, pandas, numpy. Recommends movies based on the user's input. Motivated in order to improve my ML skills in terms of data extraction/manipulation, choosing the optimal measure for the model, and model building. The dataset credited here.

JoJo Reference Finder Google Extension

A self-project Google Extension that checks for words that correlate to the Japanese manga series JoJo's Bizarre Adventure. Helped with improving my front-end skills in HTML, CSS, and JavaScript, as well as using Chrome's storage API. You can see the code and even try out the Google Extension here!

Skills

Here are some of my strongest skills (in no particular order) gained from academic & professional experiences.

  • Java
  • Python
  • C
  • HTML
  • CSS
  • JavaScript
  • SQL
  • git
  • Complex Problem Solving
  • Team Communication
  • PyTorch
  • Keras
  • TensorFlow
  • NumPy
  • Pandas
  • scikit-learn
  • Matplotlib
  • Seaborn
  • Abstract Linear Algebra+
  • Abstract Algebra
  • Multivariable Calculus+
  • Probability
  • Research Analysis
  • Technical Writing
  • Microsoft Suite
  • Google Cloud Platform, Vertex AI
  • FastAPI
  • Pydantic
  • Docker

Resume/CV

Download Resume

Contact Me

If you have any questions, please feel free to contact me here.