Hi! I'm Varun 👋🏼
I am a Software Engineer with over 3 years of experience architecting resilient, enterprise-grade backend systems. I'm currently immersed in the nuances of decentralized edge architectures, focusing on optimizing latency and resource efficiency in resource-limited environments.
Through my work and research, I aim to (A) design high-throughput data architectures that transform unstructured noise into verifiable signals, and (B) engineer privacy-first Generative AI systems that prioritize data sovereignty and architectural efficiency.
I am deeply interested in the Systems side of AI, more specifically, how we architect the infrastructure required to make large-scale models reliable, maintainable, and safe for real-world deployment.
Core Competencies
Languages
JavaPythonC++TypeScriptJavaScriptSQL
Frameworks
Spring BootSpringReactAngularNode.jsLangChainFastAPI
Cloud & DevOps
DockerKubernetesCI/CDGCPFirebase
API & Data
RESTful ServicesRabbitMQKafkaElasticsearchVector DBs
Practices & Tools
AgileTDDSystem DesignPostmanVS Code
Selected Work
Filter by category or technology to see my experience and projects.
Category
ExperienceProjectResearch
Tech Stack
javapythonreactdockerrest-apigenerative-aisystem-designtddsqldata-engineering
Founder, Engineer
March 2024 – Present
Engineered a privacy-native Generative AI platform that runs entirely on local silicon, eliminating cloud dependencies and data latency. The system orchestrates a microservices architecture (Docker) connecting a vector database (Qdrant), a local LLM inference engine (Ollama/Llama 3), and asynchronous file-system watchers (Watchdog) to ingest and index multi-modal data streams in real-time. Designed a dual-pipeline ingestion engine that separates high-level user profiling (via Mem0) from raw documentation indexing, enabling millisecond-latency semantic search across gigabytes of proprietary data.
pythondockerqdrantllama-3whisperfastapi
Systems Executive
Nov 2021 – March 2022
Designed an automated Role-Based Access Control (RBAC) system using complex SQL/PL-SQL stored procedures to govern Power BI data security. Institutionalized Test-Driven Development (TDD) protocols for analytics modules, establishing rigorous unit and integration testing standards that eliminated regression defects.
sqlpower-bitdddatabaseautomation
Event-Driven Log Analysis System
Winter 2024
Architected an event-driven microservices ecosystem for high-volume log ingestion, utilizing Docker, FastAPI, and RabbitMQ to ensure asynchronous processing resilience. Integrated a self-hosted Hugging Face model for real-time semantic enrichment, channeling data into an Elasticsearch-backed React dashboard for live visualization and full-text search.
pythondata-engineeringdockerfastapirabbitmqelasticsearchhugging-facereact
Code Refactoring Agent
Spring 2025
Developed an autonomous, privacy-centric code refactoring agent utilizing LangChain and Llama 3.1 via Ollama for fully local execution. Architected a ReAct-based workflow integrated with ChromaDB for retrieval-augmented generation (RAG), enabling the agent to maintain cross-file context during static analysis. Implemented robust fail-safes using Python’s AST module to enforce strict syntax validation and prevent regression errors prior to committing file modifications.
langchainpythoncode-refactoringreactchroma-dblocal-execution
Product Engineer
March 2025 – June 2025
Engineered high-availability solutions for enterprise data pipelines, resolving complex race conditions in REST API workflows and database synchronization. Optimized query performance and data ingestion latency across distributed mobile/web platforms, directly hardening the product architecture against connection timeouts and SSL failures.
javarest-apidebuggingsqldata-pipelinecustomer-facing
TKnowScape
Technical Writer Intern
Dec 2020 – Jan 2021
Developed detailed technical documentation, including setup guides, feature specifications, and release notes, by distilling complex engineering concepts into precise, user-oriented content, while standardizing terminology, diagrams, and workflows to enhance cross-team collaboration among developers, QA, and customer success.
technical documentation engineeringmarkdowngitconfluence
Research-OS
Spring 2025
Designed an Autonomous RAG Agent to eliminate source hallucination and automate academic curation within a local-first Llama 3 environment. Implemented a custom ETL workflow using n8n and JavaScript to physically stamp metadata onto vector embeddings in Qdrant, effectively resolving context fragmentation. This architecture delivers verifiable citation accuracy while demonstrating advanced end-to-end system ownership.
generative-ain8nqdrantragdocker
HeatMap - Location Data Visualization Web Application
Spring 2024
Developed an autonomous, privacy-centric code refactoring agent utilizing LangChain and Llama 3.1 via Ollama for fully local execution. Architected a ReAct-based workflow integrated with ChromaDB for retrieval-augmented generation (RAG), enabling the agent to maintain cross-file context during static analysis. Implemented robust fail-safes using Python’s AST module to enforce strict syntax validation and prevent regression errors prior to committing file modifications.
reactgoogle-maps-apiexpressjquery
System Associate Engineer
March 2022 – March 2024
Architected a scalable Pega-based enterprise framework supporting 10,000+ concurrent users, integrating legacy systems via SOAP/REST web services for real-time data synchronization. Implemented zero-downtime deployment strategies using Pega Deployment Manager and enforced strict TDD patterns to ensure architectural resilience.
javasystem-designpegarest-apisoaptddagileci-cd
Local Llama RAG
Fall 2025
Engineered a fully offline, privacy-centric Retrieval-Augmented Generation (RAG) system using React, Flask, and Docker to host Meta’s Llama 3. Architected a persistent containerized vector pipeline utilizing ChromaDB to enable dynamic context ingestion and retrieval from local datasets. This solution demonstrates end-to-end ownership of secure generative AI infrastructure without reliance on external APIs.
pythongenerative-aillama-3langchainreactflaskdockervector-db
BFF Playground
Spring 2025
Developed a local-first, reactive notebook environment that unifies backend logic and frontend UI execution within a single interface. Engineered a hybrid runtime architecture combining direct local Node.js access with sandboxed polyglot support (Python, Go) via the Piston API. Implemented a spreadsheet-like reactive data graph to automate state synchronization between backend processes and React components, significantly streamlining the full-stack prototyping workflow.
node.jsreactpiston-apifull-stackpolyglot-runtimereactive-data-graph
Conducted a comprehensive analysis of current literature contrasting the resurgence of ConvNeXt architectures against Vision Transformers (ViTs) alongside the paradigm shift toward Self-Supervised Learning. Identified critical technical gaps regarding model robustness against adversarial examples and data efficiency within medical imaging applications.
computer-visiondeep-learningtransformersself-supervised-learningliterary-review











