CS + STATISTICS + APPLIED ML

HARESH RAJ

M.S. candidate in Data Science & Statistics at UT Dallas, applying statistical modeling and machine learning to messy, real-world problems, recently in healthcare analytics, energy, and consumer systems.

Open to work Grad May 2027

About

Background

Several of the projects on this site started as hackathon builds or class assignments that I continued developing after the deadline passed.

Recent work includes extracting structured signal from FDA drug-safety reports, modeling healthcare access gaps across Nigeria, and detecting anomalies in oil well sensor data. Across each of these, the focus has been building systems that hold up on messy, real-world data, not just clean benchmarks.

M.S. Data Science & Statistics AUG 2025 — PRESENT

The University of Texas at Dallas · Richardson, TX

B.S. Computer Science, Minor in Statistics AUG 2022 — MAY 2025

Texas A&M University · College Station, TX · GPA 3.586

Selected work

Projects

Toolbox

Stack

Languages

PythonSQLJavaJavaScriptTypeScriptC/C++

ML & Data

pandasNumPyscikit-learnXGBoostGeoPandas

Infra & Tools

PostgreSQL / AuroraAWSGitVercelNext.jsReact

Experience

Work & Research

Tech Intern FEB 2026 — PRESENT

AdiLabs.ai · Richardson, TX

Modeling rare-disease progression using longitudinal patient event data, in partnership with AdiLabs.ai through the UT Dallas Biotech Club. Exploring knowledge graph and ontology-based approaches to link clinical concepts and improve structured patient monitoring.

AI Data Processing Research Assistant AUG 2024 — MAY 2025

Texas A&M Aggie Research Scholars Program · College Station, TX

Improved image classification accuracy for cotton morphology by labeling, cleaning, and preprocessing large-scale agricultural datasets. Developed instance segmentation models and contributed to a mobile app for field use by farmers.

Analytics Development Officer FEB 2026 — PRESENT

UT Dallas Sports Analytics Club · Richardson, TX

Mentor student teams on sports analytics projects, including debugging Python code, improving data pipelines, and guiding end-to-end project execution.