Education
University of California, Berkeley (Fall 2023 - Spring 2027)
B.S in Electrical and Computer Science
B.S in Industrial Engineering and Operations Research
Cool Math: Discrete Mathematics and Probability Theory, Probability and Random Processes, Introduction to Stochastic Processes
Cool CS: Algorithms, Machine Learning, CS61A/B
Activities: Cal Table Tennis Officer, Poker at Berkeley Member, IISE Member, Tau Beta Pi Candidate
College of San Mateo (Fall 2019 - Spring 2023)
A.S in Computer Science (Fall 2022)
A.S-Transfer in Mathematics & Computer Science & Physics (Spring 2023)
Cool Math: Discrete Math, Calculus 3, Linear Algebra, Differential Equations
Cool CS: Intro to CS 1 & 2, Data Structures & Algorithms, Computer Systems
San Mateo High School (Fall 2019 - Spring 2023)
High School Diploma
AP Calculus BC, AP Physics 1, AP CS A, AP Chinese, AP French, AP English Language
Work Experience
Incoming Algorithms and AI Software Development Intern (2024)
Genetic Sciences Division, Thermo Fisher Scientific
NLP Student Research Assistant (Spring 2022 - Fall 2022)
Yale Cardiovascular Data Science Lab (CarDS)
Developed a full-stack federated-learning demo tool for hospitals in Japan and US. Orchestrated the seamless integration of Yale AWS instances and databases, delivering substantial 2x efficiency of query intialization.
Spearheaded Yale-OHDSI by incorporating hierarchical administrative access and learning the Athena vocabulary for hosting the OHDSI SQL generation tool.
Deployed a federated BOW spaCy model for classifying patient features (Age, Gender, Risk) based on hospital texts, achieving 0.99 binary AUC and 0.87 macro-F on 77 categorical age labels. Presented model at lab meeting.
Autonomous Cognitive Assistants Software Developer (Summer 2022)
Beaver Works Summer Institute, MIT Lincoln Laboratory
Learned from Anthropic MLE Ryan Soklaski, who guided designing our own auto-differentiation package.
Led 3 team-based AI capstones: FaceID with Facenet and Whispers clustering, fine-tuning food-recognition on YoloV7 and Detectron2, and semantic search, weighted inverse document frequency image-query engine from text.
CEO and Algorithms Director(2020 - 2022)
USACO Tutor
Managed a team of tutors teaching multiple 1:1 weekly classes for USACO competitions
Prepared classes in algorithms and competitive programming for 30+ students
Optimized SEO to achieve #1 on Google & Designed a Learning Management System
Student Council President (2019 - 2020)
San Mateo High School
Reach out to community leaders and companies to promote fundraising and community
Started the petition to close SM schools due to COVID, gathering 5500+ signatures
Research
AFP and CEA Viability as EOC Biomarkers [2nd Place]
David Z. Yang
[Show/hide abstract]
[AntiAngio] [Presentation]
Women have a 1.3% chance in their lifetime of developing ovarian cancer. Out of 1000 women, 13 of them will develop malignant cancers, of which will likely be high-grade serous carcinomas, a subtype of epithelial ovarian carcinomas, which rapidly metastasize (spread) and evolve into stage 3 and 4. Ovarian cancer has one of the highest death rates of all female cancers - disproportionally contributing 5% of female cancer deaths despite only making up 2.5% of female cancers. Ovarian cancer is difficult to diagnose early, and given the name "the silent killer", because many of the current screening methods are unable to detect traces of ovarian cancer. Rather, only when the cancer has metastasized to different parts of the body, and in Stage 4, spread to unrelated places such as the lungs, breast, and brain, that current screening methods are able to pick up on the malignancy of the cancer. In our study, we analyzed different features, decided on AFP, CEA, CA125, HE4 as key features, and decided on Mean Imputation. 5-fold cross validation was implemented to prevent overfitting. My model increases sensitivity and specificity to both 89/90%.
Disease-associated Microbiome Interaction Identification [In Progress]
David Z. Yang, Advisor: Dr. Erliang Zeng
Awards
USACO Platinum (Top 2% of High Schoolers)
2020 Promotion List
USACO Problem Writer
2021 [Year of the Cow]
Facebook Hackercup T-Shirt Winner (Top 1500)
2021-2022
SM County Poetry Out Loud
2019-2020
Winner of the county competition and competed at the state
Enrichment (Mostly in progress)
Data Mining CS246
Stanford
Notes: Apriori and Park-Chen-Yu
Machine Learning CS229
Stanford
Machine Learning CS231N
Stanford
DeepLearning.AI Neural Networks
Andrew Ng
Computational Genomics with R
Book Link