ATABAK (ART) MARDAN

Skills & Stack

GenAI & Agentic
AWS Bedrock / Claude Claude Code AI-Augmented Development (Vibe Coding) LLM Cost Optimization Prompt Engineering LLM Feature Gen Vision LLMs
Modeling & ML
XGBoost Classification & Regression Bayesian & Markov Models Time-Series Forecasting Discrete-Choice Modeling Monte Carlo Simulation
Data Engineering
Python (Pandas / PySpark) SQL AWS (Glue / Redshift / Lambda) Apache Airflow ETL Pipelines Data Warehousing Feature Engineering
Geospatial Systems
Python Geospatial ArcGIS / QGIS Shortest Path Algorithms NetworkX OSMnx OpenStreetMap
Analytics & Experiment
A/B Testing KPI Design R Stakeholder Communication
Dashboarding & BI
Dashboarding Tableau Streamlit / Plotly Advanced Excel

Work Experience

Data Scientist III

Choice Hotels International

Feb 2023 - Present
  • Sales Lead Scoring Platform: Built a product that scores and ranks prospective B2B customers for hotel sales teams. Cut sales lead research from an hour to minutes for teams covering 6,000 properties. Delivered as a Tableau dashboard integrating Salesforce, internal performance data, and external vendor data. Engineered multiple ML and LLM-based features that feed a single ranked score.
  • Customer Lifetime Value Model: Built an XGBoost CLV model on 50M+ transaction records. Improved customer segment accuracy by 25%. Informed targeting decisions on $100M+ in system fees for marketing and seasonal planning. Built a deduplication pipeline to clean the underlying data.
  • Vision-LLM Marketing Photo Compliance Pipeline: Built an AI system that scores and ranks marketing photos across 6,000 franchised hotel properties against brand and compliance rules. A contractor quoted $45K and 6,090 hours to do this manually. Delivered the same coverage in a single automated run for under $1K, a 97% cost reduction. Flags violations including bathroom hero images, parking-lot-heavy exteriors, competitor logos, and guests without releases. Routed full-portfolio runs through Bedrock Batch Inference and downsampled images to reduce per-call token cost.
  • External Data Matching Pipeline: Designed a multi-stage algorithm to match external data sources against 60,000 US hotels. Replaced a single-step name-matching approach. Added fuzzy matching, geospatial matching, and LLM-based vector similarity for hard cases. Lifted match rate from 50% to 96%. Deployed on AWS Glue.
  • Climate Risk & Opportunity Dashboard: Built a first-of-its-kind tool for area directors covering 6,000 properties. Replaced ad-hoc property reviews with a single, unified view. Translated raw climate, solar, EV charging, and national park data into user-friendly KPIs (high, medium, low). Layered in GenAI (Claude through AWS Bedrock) to generate actionable, property-specific summaries.

Lead Transportation Networks Data Scientist

ICF International

Sep 2021 - Feb 2023
  • Built a Python and GIS pipeline to audit GTFS data quality across FHWA's national transit inventory. Flagged misaligned road geometry, missing stops, and other structural errors. Ranked networks by severity so the worst offenders could be addressed first. Delivered findings via technical documentation and stakeholder dashboards.
  • Built Python and GIS isochrones to measure walking and biking access to bus stops along the Bronx highway corridor for NYSDOT. Mapped current transit access gaps. Modeled scenarios for proposed north/south access restrictions. Quantified how the changes would affect resident access to transit.
  • Powered the backbone of Virginia OIPI's public data inventory by computing statewide travel-time reliability KPIs in R. Partnered directly with stakeholders to define KPI methodology. Delivered detailed methodology reports for client review.

Transportation Systems Data Modeler

C&M Associates

Aug 2019 - Sep 2021
  • Owned the data and simulation pipeline behind long-term traffic and revenue forecasts for state DOT projects, including the I-495 toll road extension. Ran data ingestion into the company data warehouse, outlier detection, survey processing, travel-demand modeling, and calibration in Python, GIS, and Excel.
  • Built revenue forecasts and sensitivity analyses for the Dulles Toll Road 30-year planning project, feeding VDOT's long-term contracting and operator decisions. Modeled toll rate scenarios and demand sensitivities in Python and advanced Excel, working directly with VDOT stakeholders.

Data Science Graduate Research Assistant

George Mason University

Aug 2017 - May 2020
  • Developed a VDOT Tableau dashboard to visualize and expose intersection-level traffic metrics, enabling improved operational awareness for state transportation stakeholders.
  • Parsed and cleaned raw sensor text feeds into a structured, queryable database to support downstream data analysis and reporting requirements.
  • Delivered an end-to-end dashboard solution by transforming raw sensor data into actionable stakeholder insights, streamlining traffic monitoring and reporting processes.
Education

PhD, Transportation Systems

George Mason University (2025)

MS, Data Science

George Mason University (2019)

MS, Transportation Systems

Middle East Technical University (2015)

BS, Civil Engineering

Azad University (2009)

Publications
Expressed Toll Lane Pricing Policy

Transportmetrica A · 2026

Dynamic Pricing Toll Roads in the US

Case Studies on Transport Policy · 2024

Hidden Markov Modeling of Travel Behavior

Transportmetrica A · 2024

Generalized Bayesian Traffic Reliability

Transportation Research Part B · 2021

[Google Scholar Profile]