Available for Full-time Roles · F1 OPT Valid through 2028

Prudhvi Raj

I'm a

I work across analytics, data engineering, and machine learning to build systems that turn data into decisions.

Python SQL Tableau Scikit-learn LLMs PySpark AWS Power BI

< About Me />

I Turn Data Into Intelligence

I'm a Data Analyst and AI/ML Engineer with a Master's in Computer Science from Pace University (3.7 GPA). I love uncovering patterns in data, building machine learning systems, and creating AI-powered applications. That means everything from LLM chatbots to fraud detection platforms with 96% accuracy. My work lives at the intersection of analytics, artificial intelligence, and scalable data engineering. I'm looking for full-time roles in Data Analytics, AI Engineering, or ML Engineering.

What Drives Me

I'm genuinely obsessed with one question: what is this data actually telling us? Whether that's building an AI assistant that explains customer churn in plain language, training models that catch fraud in real-time, or designing dashboards that turn 10K+ data points into one clear story, I love making data actionable. Give me rigorous analysis, clean pipelines, and ML that actually holds up in production.

prudhvir1509@gmail.com

Pace University

New York, USA

MS, Computer Science

F1 OPT valid through 2028

DA / AI / ML Roles

Immediate Start

Age: 22

0K+

Records Processed Daily

Processing Time Reduction

Fraud Detection Accuracy

GPA @ Pace University

< Tech Stack />

Skills & Expertise

A diverse toolkit spanning AI/ML, analytics, data engineering, and cloud infrastructure

AI & ML Engineering

Scikit-learn XGBoost LSTM SHAP / LIME LLMs Prompt Engineering MLflow

Data Analysis & BI

Pandas NumPy Matplotlib Seaborn Tableau Power BI

Languages

Python SQL PySpark Git

Data Engineering

Airflow ETL / ELT Data Warehousing Dimensional Modeling Data Quality Batch Processing

Cloud & Infrastructure

AWS S3 AWS Lambda BigQuery Docker CI/CD AWS SageMaker

Databases

PostgreSQL MySQL Redshift DynamoDB

APIs & Interfaces

REST APIs FastAPI Streamlit

< Work History />

Professional Experience

Data Analyst

Tech Mynds Inc

Sep 2025 – Present United States · Hybrid

Current

Migrated 8 Excel-based monthly reports into parameterized SQL views across MySQL and a central data warehouse, cutting reporting time from 6 hours down to 45 minutes (87% reduction) and eliminating all engineering tickets for routine data pulls
Enforced 4 Python data quality checks at the staging layer covering row counts, null thresholds, schema validation, and referential integrity, protecting 12 Tableau dashboards in weekly leadership meetings and cutting data incidents by 60% in Q1
Refactored 15+ one-off SQL scripts into a shared version-controlled Git library, removing 30% of duplicate logic and cutting new-hire ramp-up from two sprints to one
Ran A/B tests on dashboard variants using Python statistical analysis; the winning layout cut time-to-insight by 22% and was rolled out across all 12 dashboards

Data Engineer

Urpan Technologies Inc

Sep 2024 – Jul 2025 Remote, USA

Consolidated churn and revenue data from 4 separate sources (relational DB, CSV, REST API, webhooks) into a unified Python ELT pipeline, trimming per-sprint analytics prep by 15% and freeing the team from weekly manual data-stitching
Rewrote time-series processing on 50K+ customer records using vectorized Pandas with chunked I/O, cutting pipeline run time by 20% and enabling daily trend refreshes instead of weekly batch runs
Added a query-result caching layer; response time dropped from 1.2 seconds to 800ms, redundant API calls fell 40%, and the 3 most-used reports loaded below 1 second for the first time
Modeled star-schema tables across 3 domains (churn, revenue, product usage), giving analysts direct access to clean fact tables and cutting ad-hoc engineering requests by 35% within one quarter

Data Analyst

Vimtra Ventures Pvt. Ltd

Dec 2021 – Jul 2023 Chennai, India · On-site

Built a PySpark ETL pipeline loading 120K+ real estate transaction records daily via incremental refreshes, replacing a manual spreadsheet process that consumed 10 to 12 hours per week
Designed a Python NLP pipeline (spaCy, regex, NER) parsing 50+ investment memos per week into structured database tables, shrinking deal screening from 5 days to 3 days
Delivered Power BI dashboards tracking 8 investment KPIs (occupancy rate, rent yield, cap rate, ROI) used by 5+ executives in bi-weekly investment committee meetings, saving 4 hours of prep per session
Built an ML deal-scoring model ranking opportunities across 7 financial criteria (IRR, cap rate, DCR), cutting manual review from 7 days to 2 days

< Academic Background />

My Education

Master of Science

Computer Science

Pace University — New York, USA

Sep 2023 – May 2025 GPA: 3.7 / 4.0

Champion, Pace University Intercollegiate Chess Tournament — strategic thinking demonstrated outplaying 10 participants across elimination rounds.

Bachelor of Technology

Computer Science Engineering

KL University Hyderabad — India

Jun 2019 – May 2023 CGPA: 8.55 / 10.0

Best Vice-President, Cybersecurity Club — organized 12+ workshops, grew membership 35%, forged industry partnerships.

< Featured Work />

AI, ML & Analytics Projects

Building intelligent systems from data, end to end

AI / LLM

Muffin — GenAI Assistant

AI chatbot powered by Gemini 2.0 Flash that explains customer churn risk in plain language using advanced NLP and semantic similarity. Bridges the gap between raw ML predictions and human-readable insights.

Gemini 2.0SentenceTransformersStreamlitNLP

ML / AWS

Fraud Detection Platform

Real-time fraud detection on AWS with 96% accuracy and <120ms response time. XGBoost + PySpark pipeline with MLflow experiment tracking and DynamoDB for real-time storage.

XGBoostPySparkAWS SageMakerMLflow

Analytics + ML