MS Data Analytics | Webster University (Dec 2024) 📍 Piscataway, NJ → Open to US opportunities (Remote & Relocate) | STEM OPT Active
I build data-driven solutions that solve real business problems, not just notebooks that sit on a laptop.
7+ years of combined experience across healthcare, recruitment, and business analytics - including managing data relationships with 30+ NHS hospitals in the UK. Now applying that domain knowledge to data science.
Lean Six Sigma Black Belt - I don't just find problems in data. I frame them as business solutions.
One new project is published every other week, Building in public.
| # | Project | Type | Status | Live |
|---|---|---|---|---|
| 1 | Healthcare Workforce Analytics Dashboard | Python · Streamlit | ✅ Live | 🔗 Open |
| 2 | Supply Chain KPI Dashboard + DMAIC + SQL | Python · SQL · Streamlit | ✅ Live | 🔗 Open |
| 3 | Supply Chain Power BI Dashboard | Power BI · DAX | 🔨 Building | Releasing in May |
| 4 | SQL Business Dashboard | SQL · Power BI · Tableau | 🔨 Udemy project | Releasing in May |
| 5 | SQL Healthcare Claims Analysis | SQL · SQLite · Python | 📋 Planned | Releasing in May |
| 6 | HR Analytics Dashboard | SQL · Power BI · Excel | 📋 Udemy project | Expecting release in June |
| 7 | Demand Forecasting + Inventory Optimizer | Python · Prophet · Streamlit | 📋 Planned | Expecting release June |
| 8 | LLM Business Intelligence Tool | LangChain · OpenAI · Streamlit | 📋 Planned | Expecting release June |
| 9 | Cricket Analytics Dashboard | Python · Plotly · Streamlit | 📋 Planned | Expecting release June |
Analyzed 9.6M real US Medicare records to identify physician staffing gaps across all 50 states
- Processed 1.1M unique providers across 104 medical specialties
- Built an interactive 5-tab Streamlit dashboard with US choropleth maps
- Applied Lean Six Sigma DMAIC framework to structure recruitment gap analysis
- Identified Wyoming (97.7%), Vermont, and Alaska as the most critically underserved states
- Full analysis run locally on 9.6M records — dashboard shows 50k representative sample
- Live: https://karan-healthcare-analytics.streamlit.app
- Stack: Python · Pandas · Plotly · Streamlit · CMS Medicare Data
Analyzed 180,519 actual orders - found that 57% of deliveries are late across 23 global regions
- Only 42.7% on-time delivery rate - Central Africa worst at 60.7% late rate
- 15 SQL queries via SQLite - late rate by region, revenue by category, customer segments
- ABC inventory segmentation identifying Class A products driving 80% of revenue
- Full DMAIC Six Sigma structured analysis - Define through Control
- Full analysis on 180,519 orders locally - dashboard runs on 50k representative sample
- Live: https://karan-supply-chain.streamlit.app
- Stack: Python · SQL · SQLite · Plotly · Streamlit · DMAIC
Same 180,519 order dataset rebuilt in Power BI - demonstrating Microsoft stack proficiency
- 4-page interactive report: Executive Summary, Delivery Performance, Revenue & Profit, ABC Inventory + DMAIC
- DAX measures for KPI calculations - On-Time Rate, Late Orders, Total Revenue, Avg Margin
- Designed for business stakeholders - not just technical audiences
- Status: In progress - releasing Monday May 12
- Stack: Power BI · DAX · DataCo Supply Chain Dataset
Predicting employee turnover to reduce hiring costs
- Analyzed 15,000+ employee records using Logistic Regression and Decision Trees
- Achieved 90% prediction accuracy - job satisfaction identified as top turnover driver
- Recommended strategies projected to reduce turnover by 20%
- Stack: R · Logistic Regression · Decision Trees · k-NN · SVM
- Repo: Human-Capital-Analysis
Loan default prediction reducing misclassification cost by $3M
- Built Logistic Regression and Decision Tree models on 5,960 loan applicants
- Improved sensitivity to 80.65%, reducing false negatives
- Demonstrated $3M cost reduction through optimized approval strategy
- Stack: R · Logistic Regression · Decision Trees
- Repo: Bank-Loan-Decision-Making-Analysis
Customer segmentation and brand loyalty prediction
- Segmented 600 consumer profiles using K-Means clustering
- Applied Random Forest and Logistic Regression for brand loyalty prediction
- Built for AXANTEUS market research agency
- Stack: R · K-Means · Random Forest · Logistic Regression
- Repo: Consumer-Segmentation-Analysis
| Course | Platform | Section | Target Complete |
|---|---|---|---|
| Data Analysis: SQL · Power BI · Tableau · Excel | Udemy | SQL section | May 9 |
| Google Data Analytics Professional Certificate | Coursera | Starting week 4 | May 19 |
| Microsoft PL-300 Power BI Associate | Microsoft Learn | After Power BI section | Jun 2 |
| Unilever Supply Chain Analytics | Coursera | Week 7 | Jun 9 |
Languages: Python · R · SQL
Visualization: Plotly · Streamlit · Power BI · Tableau · Seaborn
ML/Analytics: Scikit-learn · Logistic Regression · Decision Trees · Random Forest
Clustering · Time Series · Predictive Modeling · A/B Testing
Database: SQL · SQLite · MySQL · Excel (Advanced) · DAX
AI/LLM: LangChain · OpenAI API (coming soon)
Process: Lean Six Sigma Black Belt · DMAIC · SIPOC · RCA · FMEA
Domain: Healthcare Analytics · Supply Chain · Recruitment Analytics
- 🏆 Lean Six Sigma Black Belt - Benchmark Six Sigma (2021)
- 🏆 Lean Six Sigma Green Belt - Benchmark Six Sigma (2021)
- 📊 MS Data Analytics - Webster University, St Louis (Dec 2024) | GPA 3.31
- 🔄 Google Data Analytics - Coursera (in progress)
- 🔄 Microsoft PL-300 Power BI - Microsoft Learn (in progress)
Senior Accounts Manager - ID Medical LLP (Healthcare Staffing, UK) Managed data relationships with 30+ NHS hospitals · Improved forecasting accuracy 25% · 15% YoY revenue growth
Senior Recruitment Consultant - QX KPO Services 453 shifts booked in one month · £25,000 revenue · Led 4-member analytics team
International Peer Mentor & Writing Coach - Webster University CRLA Level 2 Certified · Improved student outcomes 94%
Open to Healthcare Analyst · Supply Chain Analyst · Data Analyst · Business Analyst roles 📍 Piscataway, NJ · Remote · Open to relocate within US