Senior Data Scientist & AI Engineer

Specialized in AI systems, machine learning, and data analytics with expertise in building innovative solutions that drive business value.

Professional Headshot

Education

Northeastern University

Aug 2024

Master of Science in Data Analytics Engineering

CGPA: 3.6/4.0

Key Coursework:

Foundation Data Analytics, Data Management for Analytics, Deterministic Operational Research, Statistical Learning, Deep Learning, Applied Natural Language Processing

International Islamic University Islamabad

Dec 2019

Bachelor of Science in Software Engineering

CGPA: 3.5/4.0

Key Coursework:

Programming, Mathematics, Software Design, Artificial Intelligence

Experience

Sr. Data Scientist

SHARE Mobility

Columbus, OH (Remote)

Sep 2024 – Present

  • Developed an AI-powered ride booking system for SMS, Chat, and Call, automating operational tasks for 5 account managers, handling around 60 calls and over 100 conversations daily.
  • Designed and developed an agentic system leveraging Network Graphs, Fine tuning Generative AI Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), Semantic Caching, GraphQL, Vector Databases, Prompt Engineering, and Prompt Caching to enable seamless execution of complex actions on a shared mobility platform through natural language communication.

Team Steward

SHARE Mobility

Columbus, OH (Remote)

Jan 2022 - Aug 2022

  • Led development, DevOps, data, and routes optimization team.
  • Enabled data-driven decision-making by performing statistical analysis and setting up A/B tests for product improvements.
  • Developed a web-based commuter analysis platform using Angular, Node.js, GraphQL, AWS Redshift, OpenStreetMap, NetworkX, and scikit-learn for predictive analytics and geospatial intelligence. Created interactive maps and visualizations with D3.js and Chart.js, enabling optimized commute programs and securing $12M in funding.

Data Scientist

SHARE Mobility

Columbus, OH (Remote)

Apr 2021 - Dec 2021

  • Implemented rich vehicle routing solutions optimizing routing, scheduling, and resource allocation with time window constraints, resulting in a remarkable 40% reduction in overall system costs.
  • Engineered a company-wide data warehouse using Matillion, AWS Redshift and produced informative reports using AWS Quicksight.
  • Implemented customer segmentation models for personalized marketing and churn reduction.

Data Scientist

MTBC/Care Cloud Inc.

Islamabad, Pakistan

Aug 2020 - Apr 2021

  • Led the development of predictive models and reports by supervising a team of Jr. Data Scientists.
  • Developed and implemented an automation solution to streamline the medical billing process for 11 billing teams, utilizing advanced technologies such as Computer Vision, Document Alignment Parsing, Optical Character Recognition (OCR), and Selenium. This system significantly reduced manual effort, enhanced accuracy, and improved process efficiency.

Jr. Data Scientist

MTBC/Care Cloud Inc.

Islamabad, Pakistan

Oct 2019 - Aug 2020

  • Achieved an 83% prediction accuracy by developing a model for 80% mostly used procedure codes from diagnosis codes.
  • Managed daily task assignment and practice performance reports, ensuring continuous monitoring and improvement.
  • Leveraged visualization tools for reporting and analysis to present actionable insights to stakeholders.

Skills

Programming

C
C++
JavaScript
HTML
Python
R
SQL
APIs
GraphQL

Database Skills

SQL Server
MySQL
Postgres
Oracle
Teradata
MongoDB
Redis
Redshift
Neptune
Pinecone
AWS
Matillion
ETL
Airflow

Artificial Intelligence

Pandas
Numpy
Matplotlib
Seaborn
Scikit-learn
SciPy
TensorFlow
OpenCV
NLTK
Rasa
BERT
Embedding
Generative AI
LangChain
Groq
OpenAI
LlamaIndex
Web Scraping
PyTorch
DeepSpeed

Analysis Tools

Power BI
Tableau
Excel
MicroStrategy
Quicksight

Optimization

Excel Solver
PuLP
NetworkX
Clustering

Cloud Computing

S3
Lambda
ECS
Docker
Athena
SageMaker
Kubernetes
Elasticsearch
Terraform

Certifications

AWS Solutions Architect – Associate

Mathematics for Machine Learning (Coursera)

Deep Learning (Dice Analytics)

Data Warehousing & Business Intelligence (Dice Analytics)