Data Analyst · Data Engineer · ML Engineer
I build data pipelines, analytical dashboards, and machine learning systems that turn raw data into decisions companies act on. B.S. Computer Science, UMass Lowell. DP-300 Certified. Based in Lowell, MA — open to relocation.
Every project starts with a business question. Below are systems I designed, built, and shipped — each producing measurable output.
Cloud costs are the fastest-growing budget line — yet most teams lack real-time anomaly detection.
End-to-end pipeline processing 500,000 AWS billing records. IQR statistical anomaly detection flags $16,600+ in recoverable spend. Executive-ready visualizations delivered to finance and engineering leadership without analyst intervention.
Acquiring a new customer costs 5–7× more than retaining one.
ANN classification model scoring every customer by churn probability. High-risk segment captures 80.5% of actual churners — enabling targeted retention before revenue is lost.
1 in 3 students shows depression symptoms — institutions can't identify who.
Multivariable correlation across 5,000 students. Sleep deprivation and financial stress identified as top risk factors (p < 0.001). Interactive Tableau dashboard for program staff.
Fragmented Excel reports create inconsistent metrics and no forecasting.
Star-schema Power BI platform with 15+ DAX measures — YoY growth, margin decomposition, revenue forecasting. Row-level security for regional access control.
Academic failure is predictable — if you have the infrastructure to act early.
End-to-end ML pipeline with MLflow experiment tracking, multi-model comparison, Docker containerization deployed on AWS Elastic Beanstalk. Best model AUC 0.91.
Manual review analysis doesn't scale past a few hundred records.
TF-IDF feature extraction and classification pipeline for 2,000 movie reviews. Top positive/negative keyword identification with probability calibration.
Sequence modeling is the foundation of modern language systems.
N-gram language model implementing LSTM and GRU architectures. 100% top-5 accuracy on data science corpus with multi-model comparison output.
I'm actively looking for full-time Data Analyst, Data Engineer, and ML Engineer roles. Authorized to work in the U.S. on F-1 OPT. Open to nationwide relocation and H-1B sponsorship.