AI Engineer & Full-Stack Developer

I build advanced query systems & LLM pipelines that ship to production—fast.

  • • 14+ years in IT • 4+ years AI/Vector DB (pgvector, Redis, Pinecone)
  • • AWS Solutions Architect • Snowflake Core Pro
  • • Full‑stack: FastAPI, Streamlit, Next.js, WordPress
Works with
AWSAzure AIOpenAILlamaIndexPostgreSQL/pgvectorRedis
Saravanakumar Subramani portrait

About

I’m an AI application engineer, not a data scientist. I design and deliver advanced query systems, LLM pipelines, and robust APIs with a focus on retrieval quality, observability, and performance. No model fine‑tuning services here— I specialize in application engineering and production delivery.

  • • 14 years overall • 4 years AI & vector DB
  • • Certifications: AWS SAA, Developer, CCP • Snowflake Core Pro
  • • Stack: LlamaIndex, FastAPI, Streamlit, PostgreSQL/pgvector, Redis, Next.js

Core Focus

  • • RAG on pgvector/Redis
  • • Hybrid retrieval & filters
  • • Prompt & tool orchestration
  • • Caching & evals

Delivery

  • • FastAPI microservices
  • • Streamlit prototypes → prod
  • • Docker & CI/CD
  • • Cloudflare, AWS/Azure

Services

Outcome‑focused engineering. No buzzwords, just shipped systems.

Advanced Query Systems (RAG)

  • LlamaIndex on pgvector/Redis
  • Hybrid retrieval, metadata filters

LLM Pipelines

  • Prompts, tools, guardrails
  • Evals, caching, observability

APIs & Microservices

  • FastAPI, Streamlit
  • Docker, CI/CD

Cloud & WordPress

  • AWS/Azure, Cloudflare, Hetzner
  • Perf & cost tuning
LlamaIndexFastAPIPostgreSQL/pgvectorRedisBedrock/OpenAIAzure AINext.js

Featured Projects

DealCloser Assist placeholder

DealCloser Assist

  • • Live cues for reps during calls; fewer context switches.
  • Metrics: coming soon.
Next.jsFastAPILlamaIndexOpenAIpgvector
Clinical Summarization placeholder

Clinical Summarization (POC)

  • • Notes → structured summaries; faster clinical review.
  • Metrics: coming soon.
PythonBiomedBERTStreamlitPostgreSQL
Appeal Letter Automation placeholder

Appeal Letter Automation

  • • Drafts consistent medical appeal letters from docs/images.
  • Metrics: coming soon.
AWS Comprehend MedicalBedrockOpenAIS3Lambda

Deep Case Study — Clinical Summarization

  • Problem: Manual review of unstructured clinical notes is slow.
  • Approach: Extraction + rules → standardized summaries; human‑in‑the‑loop.
  • Stack: Python, BiomedBERT, Streamlit, PostgreSQL.
  • Result: Review time ↓, edits ↓, consistency ↑ (validating).

Planned metrics (to instrument):

  • • Avg. review time per case
  • • Extraction coverage (% fields filled)
  • • Manual edits per summary
  • • Clinician agreement rate (spot‑checks)

Content

Let’s Chat

Book time or drop a message. I respond quickly.