Open to roles · available nowFrankfurt + Remote DACHEU work-authorizedFull Stack AI · Data · GenAI

Siddharth
Jain.

|

Built production AI across manufacturing, healthcare, and research. Now leaping into Finance × AI — MSc at Frankfurt School, CFA Level 1 in progress, building agentic systems for fintech and banking.

14
BI Dashboards Shipped
7
GenAI Bots in Production
300+
Daily Enterprise Users
3
Countries Worked In

RECRUITER MODE

Too long to read?

Skip the scroll. Ask my AI anything — get the whole story in 60 seconds.

Scroll

01 / About

More than a
resume.

I'm a Full Stack AI Engineer who's led enterprise AI transformation at CEO level, conducted medical-imaging research at the Coulter BME department (Georgia Tech × Emory), and done cybersecurity work at Georgia Tech — all before finishing my master's degree.

Off the screen, I'm a national-level basketball player, a geopolitics obsessive, an active debater, and a die-hard Manchester United supporter. I think the best engineers are curious about everything — not just code.

22
Years old
3
Countries
7
GenAI chatbots shipped
siddharth.json
{
"name": "Siddharth Jain",
"age": 22,
"location": "Frankfurt, Germany",
"role": "Full Stack AI Engineer",
"currently": {
"studying": "MSc AI @ Frankfurt School",
"open_to": "new opportunities"
},
"shipped": {
"bi_dashboards": 14,
"genai_bots": 7,
"kpis_consolidated": "250+",
"enterprise_users": "300+",
"plants": 10
},
"experience": [
"Suzlon Energy (AI Eng)",
"Georgia Tech (Cybersec)",
"Coulter BME, GT × Emory",
"IIT Jammu (5G Research)"
],
"interests": [
"basketball", "geopolitics", "MUFC", "debate"
],
"status": "open_to_opportunities"
}

Resume

The full one-page PDF — ATS-friendly and always current.

Download Resume

Life in frames

Georgia Tech, Atlanta

Georgia Tech, Atlanta

with the squad

with the squad

good times

good times

Kashmir, India

Kashmir, India

out exploring

out exploring

adventure mode

adventure mode

family first

family first

somewhere in the world

somewhere in the world

02 / Skills

Four domains. One stack.

Pick a role — see what I ship, what I ship it with, and where it overlaps with the others.

01 / 04

Data Engineer

Pipelines & lakehouses at scale.

50+ tables fed downstream

What I ship

  • 01Production pipelines moving TB-scale data daily
  • 02Bronze → Silver → Gold lakehouse layers
  • 03Real-time ingestion with Kafka + Spark Streaming

Stack

SparkKafkaAirflowdbtSnowflakeDatabricksAWS Glue

Overlaps with

Tap a role above — or click an overlap to jump

FOUNDATION

In every single project.

These seven carry everything above — whichever role I step into, whichever team I join.

Python

SQL

AWS

Azure

Docker

Git · CI/CD

Linux

Frameworks & tools in daily use

LangChainLangGraphChromaDBPineconeStreamlitFastAPIOpenAIAnthropic ClaudeGroqGeminiPySparkHadoopHugging FaceMLflowDockerKubernetesTerraformGitHub ActionsAWS GlueSnowflakeDatabricksdbtAirflowKafkaPower BITableauLookerPyTorchTensorFlowScikit-learnLangChainLangGraphChromaDBPineconeStreamlitFastAPIOpenAIAnthropic ClaudeGroqGeminiPySparkHadoopHugging FaceMLflowDockerKubernetesTerraformGitHub ActionsAWS GlueSnowflakeDatabricksdbtAirflowKafkaPower BITableauLookerPyTorchTensorFlowScikit-learn

03 / Projects

Things I've Built

Live systems you can try yourself — each with a running demo.

Gen AI
LIVE
Personal Project

CADUCEUS — Virtual Molecular Tumor Board

Give it a full oncology case — pathology, radiology, genomics, labs — and 55+ specialist agents across 7 layers deliberate in parallel (map-reduce fan-out pushes genuine concurrency past 40, ~150 at peak) to produce a fully-cited, guideline-concordant treatment recommendation with documented dissent, a provenance audit trail, and human-in-the-loop checkpoints. Runs offline on synthetic cases, or LIVE on a real de-identified TCGA patient — variants annotated against CIViC, drug safety from openFDA, trials from ClinicalTrials.gov, literature from Europe PMC. Every claim traces to a retrievable source; a clinician holds the final gate. You watch the 56-agent board light up live over SSE.

56 agents · 100% claims cited · real TCGA patients · keyless
MethodologyHierarchical supervisor over departmental sub-graphs; the four diagnostic departments run as one parallel wave, while the Variant / Trial / Abstract agents fan out one worker per item (~150 concurrent at peak). A dedicated Dissent / Red-Team agent argues against the consensus and a Citation Validator blocks any unsourced claim — 100% citation grounding. Decision support, not decision making; built only on free, public, synthetic or de-identified data; not an FDA-cleared device.
Multi-AgentNext.jsSSECIViCTCGAopenFDAHealthcare AI
Gen AI
LIVE
Personal Project

PRAETOR — Agentic M&A Due-Diligence Engine

Pick any company on a 3D world globe (searched live across the global GLEIF registry), and a team of 9 specialist agents — integrity, financials, legal, tax, commercial, HR, cyber, ESG, and a live web pulse — run due diligence in parallel on real public data. They cross-reference each other into one red-flag register, then TRIBUNAL renders a defensible GO / NO-GO / CONDITIONAL verdict. Every claim traces to its source (SEC EDGAR filing, OFAC/GLEIF record, or web article); a human holds the final gate. You watch the agents work — and reference each other — live over SSE.

9 agents · every claim cited · live web pulse
MethodologyAgents EXTRACT findings deterministically from real sources (always cited); the LLM only narrates and ranks. A provenance invariant is enforced centrally — no finding without a source_ref. An adversarial 2-verifier audit drove false-positive control (e.g. an offensive IP suit is an asset, not a liability; 0 false-HIGH on the First Solar case).
Multi-AgentGroqFastAPISSEReact3D Globe
Gen AI
LIVE
Personal Project

RegRadar — EU Regulatory-Impact Engine

Agentic system that watches the EU regulatory firehose (EUR-Lex / CELLAR), extracts the concrete obligations from each act, maps them to a bank's systems, ranks them by deadline and risk, and drafts the gap-assessment memo in English and German. Every claim is citation-verified against the live EUR-Lex source — programmatically, with a human-approval gate.

100% citation integrity · F1 0.957 on DORA
MethodologyF1 0.957 — live extraction graded vs a hand-labeled DORA oracle (Reg (EU) 2022/2554, 64 EUR-Lex articles); precision 0.917 · recall 1.000; 100% citation integrity on accepted obligations.
Multi-AgentEU RegulationRAGFastAPIGroqEUR-Lex
Data Science
LIVE
Personal Project

CreditForge — Bank-Grade Credit Risk Platform

End-to-end credit-risk stack built to Basel / IRB methodology on Freddie Mac mortgage data. WoE scorecard plus a LightGBM challenger for PD, joined with LGD and EAD into Expected Loss. Leakage-safe point-in-time targets, out-of-time validation, SHAP reason codes, fairness testing, and drift monitoring. A "Risk Copilot" agent team sits on top of the platform tools.

PD · LGD · EAD → EL · Basel / IRB
MethodologyOut-of-time vintage split on the Freddie Mac Single-Family sample; PD validated by Gini/KS, isotonic calibration, and PSI stability — CI-gated against thresholds.
LightGBMPD / LGD / EADSHAPBasel / IRBFastAPINext.js
Data Engineering
LIVE
Personal Project

AEGIS Live — Real-Time Streaming AML

A live surveillance wall that scores real Bitcoin-mempool transactions for money-laundering risk the instant they arrive. Resilient WebSocket ingestion → event bus with backpressure → a sliding-window transaction graph → an ensemble of sanctions screening, LightGBM, and anomaly detection → subgraph-explained alerts with LLM-drafted SARs. Fast-path p95 in single-digit milliseconds. Built solo for $0.

Live BTC mempool · single-digit-ms p95 latency
MethodologyFast-path p95 in single-digit milliseconds (CI-gated); detection PR-AUC measured on a synthetic AMLSim-style typology set (Elliptic-swappable). Sanctions exact-hits are ground truth; live risk scores are predictions pending review.
StreamingAMLLightGBMGraphWebSocketFastAPI
Gen AI
LIVE
Personal Project

Recoupe — Autonomous Subrogation

Multi-agent platform that reads closed insurance claims, assigns fault by jurisdiction, computes the recoverable amount, drafts demand letters, and works counter-offers. Seven specialized agents over a RAG layer of US negligence law + carrier behaviour. Every decision citation-grounded and auditable.

7 agents · citation-grounded
MethodologyGraded on synthetic claims with known-true fault & recoverable values: liability MAE, quantum error, and citation integrity. Deterministic math; the LLM only extracts and narrates.
Multi-AgentRAGGroqFastAPIInsurance AI
Source privateCase study
Gen AI
LIVE
Personal Project

QUORUM — AI Investment Committee

A simulated investment committee of specialized agents — bull, bear, macro strategist, quant/risk officer, PM, and critic — that argue from real market data across structured debate rounds and converge on a documented allocation, with a human holding the final gate. Every number is computed deterministically in Python (SEC EDGAR, prices, FRED); the LLM only narrates. Paper-only, point-in-time backtested vs SPY.

6 agents · deterministic numbers · backtested vs SPY
MethodologyPoint-in-time backtest vs SPY on a $10k paper book, trading costs included — reported as directional, not an alpha claim. Numbers are deterministic Python; a grounding guardrail rejects any unsourced figure.
Multi-AgentSEC EDGARBacktestingFastAPINext.jsSSE
Gen AI
LIVE
Personal Project

AEOLUS — Renewable Fleet Operations Brain

A governance-first multi-agent "operations brain" for a wind fleet: it detects a degrading turbine, root-causes it, prices acting now vs later against the live German power market and weather, schedules the cheapest safe maintenance window with an OR-Tools CP-SAT solver, drafts the work order, and routes it for one-click human approval — every action behind a policy gate and an immutable hash-chained audit log. The LLM reasons; OR-Tools does the math.

OR-Tools economic scheduling · human-gated agents
MethodologyReal Kelmarsh SCADA (6× Senvion MM92, 2016, CC-BY-4.0) scaled to a 20-turbine fleet; live German day-ahead prices (energy-charts) + Open-Meteo. "Value protected" = lost generation avoided by the cheapest safe OR-Tools window + P(failure)×(unplanned−planned); normality models trained on the clean baseline only.
LangGraphOR-ToolsMulti-AgentLakehouseFastAPIReact
Gen AI
LIVE
Personal Project

Dam Rehabilitation Chatbot

AI-powered chatbot for dam condition assessment and rehabilitation planning. Guides engineers through structural inspection, interprets damage data, and recommends maintenance strategies.

Civil Infrastructure AI · Live
MethodologyDemonstration assistant — not a benchmarked model.
StreamlitPythonLLMAI ChatbotCivil AI
Code

More on github.com/sidnov6

04 / Stack

From source systems to intelligent action.

Most data engineers stop at the warehouse. Most ML engineers start there. I own the entire journey — from connecting to enterprise systems to the AI products built on top.

SOURCES

SAP

SAP ERP

Enterprise Resource Planning

SQL

SQL Server

.NET Portals & Internal Apps

SF

Salesforce

CRM & Sales Pipeline

SP

SharePoint

Documents & Files

WD

Workday

HR & Payroll Records

TRANSFORM

Pipeline

ETL · ELT · Real-time + Batch

Spark · dbt · Airflow

LAKEHOUSE

LAKEHOUSE
BRONZERaw
SILVERCleaned
GOLDBusiness-ready

Snowflake · Databricks

AI / BI

BI Dashboards

Power BI · Tableau

GenAI Chatbots

LangChain · RAG · LLMs

ML Models

PyTorch · Scikit-learn

Predictions

Forecasting · Anomaly

ONE PERSON · FULL STACK · END-TO-END OWNERSHIP

LIVE DATA FLOW
01

Source Integration

Direct connectors to enterprise systems — SAP ERP, SQL Server, Salesforce, SharePoint, Workday. Real-time CDC streams or scheduled batch pulls, whatever the system demands.

02

Transformation & Modeling

ETL/ELT pipelines built with Airflow, dbt, and Apache Spark. Data quality, lineage, and orchestration baked in. Refresh cadence configurable from real-time streaming to nightly batches.

03

Lakehouse Architecture

Medallion design on Snowflake or Databricks. Bronze (raw) → Silver (cleaned) → Gold (business-ready). One governed source of truth for the entire organization.

04

AI & BI Layer

Power BI dashboards, GenAI chatbots, ML models, and predictive analytics — all consuming from the same governed gold layer. End users get answers, not pipelines.

05 / Journey

Full-time · India's leading wind-energy company

Suzlon Energy

AI Engineer — CEO Office, Manufacturing

Manufacturing AI, Data & Digitisation Strategy

Jun 2025 – Jun 2026 · Pune, India

Drove AI, BI, and data initiatives across Suzlon's manufacturing organisation — 10 plants in Asia and Europe, 300+ daily users, 14 BI dashboards, 7 production GenAI bots, 250+ KPIs.

Wind Energy

scroll to explore

05 → The Pivot

Manufacturing was yesterday.
Finance is today.

One year of manufacturing AI at the CEO Office showed me what AI can do at scale. Now — back at Frankfurt School full-time — I am taking that playbook to a $50 trillion industry: finance.

Jun 2025 — Jun 2026

Yesterday

Manufacturing AI

Suzlon Energy · CEO Office

14
dashboards
10
plants
300+
users
SAPSparkPower BIGenAI

THE LEAP

2026 →

2026 →
LIVE

Today

Finance AI

Quant · Fintech · Bank AI

$50T+
industry
CFA
L1 — pursuing
potential
Agentic AIQuant MLRAGCFA

Why now

Finance needs AI — urgently.

$15.7T

AI's contribution to global GDP by 2030

PwC, Sizing the Prize

$340B

Annual GenAI value potential in banking

McKinsey GenAI Report

$97B

Financial-services AI spend by 2027

IDC, 2024

95%

Finance leaders deploying GenAI

Accenture / Citi GPS

What the industry says

The most influential voices are aligned.

JD

Jamie Dimon

CEO, JPMorgan Chase

AI may prove to be as transformational as some of the major technological inventions of the past several hundred years — printing press, electricity, the internet.

Annual Letter, 2024

JF

Jane Fraser

CEO, Citigroup

GenAI will turn knowledge workers into super-knowledge workers. It will revolutionize financial services in the next decade.

Citi GPS Report, 2024

LF

Larry Fink

CEO, BlackRock

AI has the potential to transform every aspect of our business and the broader financial system — from research, to risk, to client service.

Davos 2024

TP

Ted Pick

CEO, Morgan Stanley

Wealth management may well be the most powerful AI use case in our industry. We are only scratching the surface.

Q1 Earnings Call, 2024

What I want to build

AI × Finance: the use cases I am excited about.

Algorithmic Trading

ML-driven strategies, sentiment models, high-frequency signals

Fraud Detection

Real-time anomaly detection across millions of transactions

Credit Risk ML

Alt-data scoring beyond FICO — fairer, faster, more accurate

Robo-Advisors

Autonomous portfolio construction, rebalancing, tax loss harvest

Document AI

10-Ks, prospectuses, contracts, ISDA agreements — at scale

Agentic Research

AI agents that read filings, news, and build investment theses

RegTech & Compliance

AML, KYC, transaction monitoring — automated, auditable

Earnings NLP

Sentiment + tone analysis on earnings calls and shareholder letters

Bridging the gap

Technical depth meets finance fluency.

To build the right AI agents for finance, you need to speak the language of money. So I am not just leveling up technically — I am pursuing the CFA Level 1.

CFA · LEVEL 1 · PURSUING

Currently studying

2026 sitting · not yet cleared

Ethics
Quant
Econ
FSA
Corp
Equity
Fixed
Deriv
Alt
PM
Study progress35%
+

AI ENGINEERING · LEVELING UP

Deeper in the stack

Agentic + RAG systems

LangGraph
Agentic AI
RAG
Vector DBs
MCP
LLM Eval
FastAPI
Streaming
PyTorch
Quant ML
Daily buildingACTIVE
=

AI agents that actually understand finance.

How you can help

I’m looking for people who
want to take this leap with me.

If you build, hire, mentor, or invest in Finance + AI — or know someone who does — please reach out.

Roles in fintech, quant, or bank AI teams

Mentors with finance + ML backgrounds

Collaborators on agentic finance projects

Intros to teams shipping AI in production

06 / Education

Academic Foundations

From VIT Vellore to Frankfurt School — building the foundations that power real-world enterprise AI.

Frankfurt School of Finance & Management
In Progress
FS

FT #32 (Finance & Mgmt) · Frankfurt

MSc Artificial Intelligence & Data Science

Frankfurt School of Finance & Management

2026 – 2028 · Frankfurt, Germany 🇩🇪

Pursuing an MSc in AI & Data Science at one of Europe's leading finance and management schools, located in the heart of Frankfurt — Germany's financial capital.

  • Ranked #32 worldwide — among the strongest global positions in Finance & Management (FT Global Rankings)
  • Specialising in applied AI for financial and industrial systems
  • Located in Frankfurt — the financial capital of continental Europe
  • Research focus: enterprise LLMs, agentic finance, and data-intensive AI systems

Key Courses

Machine LearningDeep LearningNLP & LLMsData EngineeringAI EthicsCloud ComputingResearch Methods
VIT Vellore
Completed
VIo

#12 India · Vellore

B.Tech — Information Technology

VIT Vellore

2021 – 2025 · Vellore, Tamil Nadu, India 🇮🇳

Completed B.Tech in IT from one of India's top-ranked private engineering universities, while completing 3 international research internships across the USA and India.

  • Ranked #12 in India among top universities (NIRF + QS rankings)
  • Completed 3 international research internships during undergrad — Georgia Tech, Coulter BME (GT × Emory), IIT Jammu
  • National-level basketball player throughout all 4 years
  • Led the ACM Student Chapter as Operations & Marketing Head — raised $11K in sponsorships

Key Courses

Data Structures & AlgorithmsDatabase SystemsMachine LearningComputer NetworksOSSoftware EngineeringLinear Algebra & Probability

07 / Life

Beyond the Terminal

What happens when the terminal is closed.

🇮🇳 Vellore🇮🇳 Jammu🇺🇸 Atlanta🇮🇳 Pune🇩🇪 Frankfurt— all by 22
Basketball team lineup
DPS VASANT KUNJ · NATIONAL LEVEL
Point guard · Played nationally

National Level Basketball

Basketball isn't a hobby — it's where I learned everything that matters: reading situations in real time, leading under pressure, trusting teammates. Playing at national level shaped how I think about systems, strategy, and performing when it counts.

National-level competitorPoint guardTeam leadershipPressure performance
Manchester United
GLORY GLORY MAN UNITED
Red Devils · Theatre of Dreams

Manchester United

Die-hard Red Devil through every trophy and every rebuild. Supporting United taught me loyalty, resilience through brutal seasons, and absolute belief in systems even when short-term results disagree. Glory Glory Man United — we always come back.

Die-hard supporterPremier League analystChampions LeagueGlory Glory
Geopolitics
THE GRAND CHESSBOARD
The grand chessboard

Geopolitics & World Affairs

I follow geopolitics the way some people follow football — obsessively, analytically, with strong opinions. From Indo-Pacific power shifts to EU economic architecture, from energy markets to defense policy. Understanding power structures makes me a sharper systems architect.

Indo-Pacific dynamicsEU economic policyEnergy geopoliticsDefense strategy
Debate
Evidence-based argumentation

Debate & Oratory

Debating sharpens the mind in ways no technical course can. I participated in collegiate competitions and Model UN, and I believe deeply in structured argumentation and evidence-based reasoning. It's made me a sharper thinker and a much better presenter to CXO stakeholders.

Collegiate debateModel UN delegateCXO communicationEvidence-based reasoning

07 → Giving Back & Learning

Teach what I know. Learn what I need.

Taught 250+ students, raised $11,000 as a student ops lead — and every day, the next skill I need.

Giving Back

Volunteer Educator

Becoming I Foundation

Mar 2022 – Aug 2024 · Vellore, India

Python & Mathematics — 4 Government Schools

  • Taught Python programming and Mathematics to ~200 students across 4 government schools in Tamil Nadu.
  • Designed a beginner-friendly curriculum for students with no prior computer exposure.
  • Built long-term mentoring habits and public-speaking confidence through grassroots teaching.

Volunteer Educator

Suzlon Energy CSR

Jun 2025 – Jun 2026 · Pune, India

AI Literacy & Digital Inclusion

  • Ran weekly AI literacy sessions for children of plant staff and underserved communities.
  • Taught foundational AI, digital literacy, and logical problem-solving to 50+ students.
  • Translated complex ML concepts into accessible, beginner-friendly modules.

Operations & Marketing Head

ACM Student Chapter — VIT

Mar 2022 – Aug 2024 · VIT Vellore

Association for Computing Machinery

  • Raised ~$11,000 in sponsorships through 200+ cold calls and corporate outreach.
  • Ran end-to-end ops for multi-day events with 500+ participants — hackathons, ideathons, talks.
  • Led cross-functional student teams across operations, marketing, design, and tech.
250+ students taught$11K in sponsorships raised4 government schools reached3+ years volunteering

Always Learning

Agentic AI Systems

Daily · always shipping

Building agentic finance projects in LangGraph, MCP servers, RAG pipelines over financial documents, and LLM evaluation harnesses. Treat learning as a build practice, not a course list.

Reading & Research

Capital markets · geopolitics · AI papers

Following the agentic finance shift across JPMorgan, Citi, Goldman, Two Sigma. Reading earnings letters, CEO letters, and frontier AI research weekly. I write the best of it up on the blog.

"If I am not learning, I am not building. If I am not teaching, I am not learning."

personal mantra

08 / Writing

Notes on AI in Finance.

Where the agentic moment meets capital markets — what is happening, why it matters, and what I am building toward.

09 / Contact

Let's Connect

AI opportunity, research collaboration, or you just want to debate geopolitics — I'm all ears.

Open to Opportunities

Actively looking for full-time AI/ML engineering roles, research collaborations, and interesting problems to solve.

Full-timeRemote OKRelocation OpenResearch

Message sent directly — no email client needed