Vraj Patel

M.S. Data Science student at Stony Brook University. I build low-latency financial analytics engines, distributed ML pipelines, and scalable data-driven applications. Experienced in optimizing WebSocket streams and implementing real-time options strategies.

Data Ingest
182 snaps/s
Telemetry
<100ms
Markets
35k+
Vraj Patel

Technical Skills

Languages, frameworks, and tools I use to build scalable systems.

Python
JavaScript
TypeScript
React
Next.js
FastAPI
Node.js
Tailwind
AWS
GCP
Docker
Kubernetes
Linux
Git
PostgreSQL
MySQL
MongoDB
Redis
TensorFlow
PyTorch
scikit-learn
Pandas
NumPy
R
Spark
Kafka
Airflow
Java
Scala
Streamlit
Plotly
Tableau
GraphQL
HTML5
CSS3
Python
JavaScript
TypeScript
React
Next.js
FastAPI
Node.js
Tailwind
AWS
GCP
Docker
Kubernetes
Linux
Git
PostgreSQL
MySQL
MongoDB
Redis
TensorFlow
PyTorch
scikit-learn
Pandas
NumPy
R
Spark
Kafka
Airflow
Java
Scala
Streamlit
Plotly
Tableau
GraphQL
HTML5
CSS3
Python
JavaScript
TypeScript
React
Next.js
FastAPI
Node.js
Tailwind
AWS
GCP
Docker
Kubernetes
Linux
Git
PostgreSQL
MySQL
MongoDB
Redis
TensorFlow
PyTorch
scikit-learn
Pandas
NumPy
R
Spark
Kafka
Airflow
Java
Scala
Streamlit
Plotly
Tableau
GraphQL
HTML5
CSS3
Python
JavaScript
TypeScript
React
Next.js
FastAPI
Node.js
Tailwind
AWS
GCP
Docker
Kubernetes
Linux
Git
PostgreSQL
MySQL
MongoDB
Redis
TensorFlow
PyTorch
scikit-learn
Pandas
NumPy
R
Spark
Kafka
Airflow
Java
Scala
Streamlit
Plotly
Tableau
GraphQL
HTML5
CSS3

Featured Projects

A selection of my recent work in full-stack development, quantitative data engineering, and real-time systems.

Lot Lab – Urban Planning Simulator
🏆 Environmental Track Winner – NVIDIA SparkHack NYC ($5,000)

Lot Lab – Urban Planning Simulator

Apr 2026

Built an on-device urban planning simulator for NYC vacant lots on the Acer GN100 (NVIDIA GB10 Grace Blackwell Superchip), ingesting 7+ NYC open datasets and scoring parcels across 12 human & environmental use types with GPU-accelerated RAPIDS pipelines. Implemented NVIDIA cuOpt district-level optimization producing population-weighted plans; wrapped the platform as an OpenClaw skill with Nemotron-driven recommendations and a local Flux render pipeline for concept visuals.

NVIDIA RAPIDScuOptNemotronFluxFastAPIReactMapLibre GLOpenClaw
Polymarket Arbitrage – DeFi Prediction Market Engine
Mar 2026

Built a high-performance async data pipeline ingesting Polymarket order books at 182 snaps/sec across 35k+ active on-chain markets. Building a leading indicator engine leveraging crowd wisdom signal processing; conducting hypothesis testing on whether decentralized markets are immune to Goodhart’s Law.

PythonasyncioSQLiteWebSocketsRESTful APIsPandasQuantitative Finance
CampusPool (WolfiePool)
Feb 2026

Architected a geospatial clustering engine matching users by directional bearing, proximity, and time windows to dynamically calculate distance-weighted Uber fare splits. Integrated Google Gemini 2.5 Flash API for a context-aware chatbot and personalized CO2 reduction insights.

ReactTypeScriptFastAPISupabaseGemini APIElevenLabsMapboxUber API
SBU VibeCheck – Real-Time Campus Event Platform
Nov 2025

Hackathon-winning, production-grade event platform in talks for official university integration. Features a precision RAG chatbot using OpenAI gpt-4o over ChromaDB, automated AI pipelines with ~90% API cost reduction, and sub-200ms real-time sync via WebSockets.

ReactFastAPIPostgreSQLChromaDBOpenAI gpt-4oWebSocketsAuth0GCP Cloud
VociTrade – Voice-First Automated Trading Engine
Dec 2024

Engineered a multimodal trading interface executing real-time NSE/BSE orders, proven to reduce latency by 60% and input errors by 80% in A/B testing. Integrates RAG for context-aware market sentiment analysis, demonstrating agentic AI workflow capabilities.

ReactPythonFastAPIWebSocketsGemini FlashDhan APIElevenLabs

Work Experience

Building scalable infrastructure and extracting signals from noise.

Flits (India) logo

FinTech Data Scientist Intern

Flits (India)

Feb 2025 – May 2025
  • Built real-time options analytics engine processing NSE/BSE tick data, implementing asyncio WebSocket client with 40% reduction in monitoring overhead.
  • Engineered ETL pipeline parsing 170K-row instrument master file, applying SIC code mapping and outputting normalized PostgreSQL schema.
  • Developed sector rotation tracker correlating NIFTY movements, implementing z-score normalization to identify top decile outperformers.
YHONK – Noise Pollution Mitigation logo

Data Analyst Intern

YHONK – Noise Pollution Mitigation

May 2024 – Jun 2024
  • Built distributed web scraping pipeline (Selenium Grid) with rotating proxies, extracting 50K+ school records with exponential backoff retry logic.
  • Executed geospatial analysis on 2.6M GPS-tagged honking events, implementing PostGIS queries and R-tree indexing to compute violation hotspots.
  • Applied ARIMA time-series decomposition on hourly violation counts, detecting 171% weekday anomaly and 130% seasonality.
AIT Brain Lab (Thailand) logo

International Research Intern

AIT Brain Lab (Thailand)

Jul 2023 – Aug 2023
  • Fine-tuned T5-base and FactorSum (BART-based) transformers on ParaSCI dataset, achieving 0.42 ROUGE-L score for abstractive summarization.
  • Built Flask REST API with Celery task queue for async inference, implementing request batching to reduce P95 latency from 4.2s to 1.8s.

Education

Academic background and degrees.

Stony Brook University logo
Stony Brook University
2025 – 2027
M.S. in Data Science
New York, USA
Pandit Deendayal Energy University logo
Pandit Deendayal Energy University
2021 – 2025
B.Tech in Computer Engineering
India
CGPA: 8.72

Certifications

AWS Solutions Architect (Dec 2025)Alteryx Designer Core (2027)NLP Program, AIT (2023)

Get In Touch

Have a question or want to work together? Drop me a message and I'll get back to you as soon as possible.

Send me a message