
Project Overview
A multimodal AI system that analyzes recorded video interviews using audio, video, and LLM-based semantic evaluation to generate objective, time-evolving candidate assessments.

System Architecture
Multimodal Temporal Analysis Pipeline (Hybrid Signal + LLM)
6-stage Analysis Pipeline
Audio/Video Feature Extraction
Whisper Speech-to-Text
AI Semantic Scorer (Qwen2.5-3B)
Temporal Performance Tracker
Interactive Analytics Dashboard
Input (AV/JD) → Parallel Extraction → Temporal Sync → Behavioral/Semantic Analysis → Output (JSON/Dashboard)

Temporal multi-stage pipeline combining signal processing and LLM-based evaluation
Semantic Evaluation & Q&A

Multimodal Analysis

Automated Q&A Pairing
Core Features
Multimodal analysis (audio + video + text)
LLM-based evaluation
Time-based scoring
Automated Q&A pairing
Behavioral metrics tracking
6-Stage Pipeline Breakdown
Stage 1
Timebase (Synchronization)
Stage 2
Extraction (Multimodal Features)
Stage 3
Temporal Segmentation
Stage 4
Behaviour Analysis
Stage 5
Semantic Scoring (LLM)
Stage 6
Aggregation (Final Verdict)
Project Impact
Reduces hiring subjectivity using data-driven evaluation
Supports scalable automated interview analysis
Tracks candidate performance evolution over time
Visual Showcase






Explore More Work
Deep dive into other high-performance solutions.

A multimodal AI system that analyzes recorded video interviews using audio, video, and LLM-based semantic evaluation to generate objective, time-evolving candidate assessments.
System Architecture
Multimodal Temporal Analysis Pipeline (Hybrid Signal + LLM)
6-stage Analysis Pipeline • Audio/Video Feature Extraction • Whisper Speech-to-Text • AI Semantic Scorer (Qwen2.5-3B) • Temporal Performance Tracker • Interactive Analytics Dashboard
Flow: Input (AV/JD) → Parallel Extraction → Temporal Sync → Behavioral/Semantic Analysis → Output (JSON/Dashboard)

A distributed file storage system that transforms GitHub repositories into a personal cloud filesystem using an HDFS-inspired architecture with encryption, deduplication, and multi-layer caching.
System Architecture
Distributed Node-Based Storage
NameNode (Index Repository) • DataNodes (GitHub Repositories) • Client (Web/Mobile App) • Edge Proxy (Cloudflare Workers)
Flow: Client → AES Encryption → Chunking → Parallel Upload → DataNodes → NameNode Index Update

A microservices-based AI platform that analyzes products using real-time data, generates objective recommendations with Gemini AI, and helps users make data-driven purchasing decisions.
System Architecture
Microservices Architecture
Frontend (React) • Backend (Spring Boot) • AI Service (FastAPI + Gemini) • Database (PostgreSQL)
Flow: Search → Fetch → Analyze → Recommend

A full-stack AI-powered survey platform that generates intelligent questions using LLMs and provides real-time analytics with interactive visualizations.
System Architecture
AI-Powered Survey Architecture
Frontend: React + Vite + Tailwind • Backend: Node.js + Express • Database: Supabase • AI: OpenRouter (Claude 3.5)
Flow: User Prompt → OpenRouter LLM → Generated Survey → Supabase DB → Interactive Analytics
A multi-agent orchestration system where a central commander agent coordinates specialized sub-agents (researcher, analyzer, coder, assistant) to execute complex tasks autonomously, including web interaction and real-world action simulation.
System Architecture
Commander–Subagent Orchestration
Commander Agent (Orchestration) • Researcher Agent (Data Gathering) • Analyzer Agent (Reasoning) • Coder Agent (Execution) • Assistant Agent (Coordination)
Flow: Input → Task Decomposition → Delegation → Parallel Execution → Result Aggregation
A scalable data annotation platform that manages the full labeling lifecycle with multi-level quality assurance, task orchestration, and integrated payment workflows.
System Architecture
Multi-Stage Workflow Engine
Task Ingestion & Splitting • L1 Annotation Queue • L2/L3 Review Pipeline • Escrow Payment Engine • Audit & Logging Layer
Flow: Upload → Split → L1 Annotate → L2 Review → L3 Approve → Export + Payout
A conversational AI system that allows users to interact with MySQL databases using natural language, generating and executing SQL queries with intelligent responses.
A dual-mode AI chatbot powered by Gemini that supports general conversations and intelligent movie recommendations using natural language queries and OMDB integration.

A full-stack inventory management system with real-time tracking, invoicing, barcode generation, and analytics for retail and warehouse operations.