MIDS Capstone Project Spring 2026

InterviewPro: Agentic AI Mock Interviewer

Problem & Motivation

Millions of job candidates face high-stakes interviews without adequate preparation—not due to lack of effort, but due to limited access to quality resources. Traditional mock interviews with human coaches are expensive, inconsistent, and difficult to scale. As a result, candidates struggle with:

  • Lack of access to expert feedback
  • No way to practice on demand
  • Inability to track improvement over time
  • Inconsistent evaluation standards

InterviewPro was built to bridge this interview readiness gap by providing an always-available, scalable, and objective mock interview system. 


Data Source & Data Science Approach

Data Source

The system leverages the RecruitView dataset, a multimodal behavioral dataset designed for interview performance research. It includes:

  • 2,011 video responses
  • 300+ participants
  • 76 curated interview questions
  • Multimodal signals: video, audio, and transcripts

This dataset provides realistic, diverse, and human-labeled benchmarks for evaluating interview performance. 

Data Science & Modeling Approach

InterviewPro integrates multiple AI components into an end-to-end pipeline:

1. Resume Parsing

  • Extracts structured information (projects, roles, experience) from PDFs/DOCX
  • Enables personalized question generation

2. Interview Planning (LLM)

  • Uses Amazon Nova Lite to generate tailored behavioral and resume-based questions
  • Produces a structured interview plan

3. Real-Time Interview Agent

  • Uses OpenAI Realtime for the live voice interview experience
  • Conducts live interviews with dynamic follow-up questions
  • Maintains conversational flow via tool-based orchestration

5. LLM-Based Scoring System

  • Uses a rubric-driven prompt to evaluate responses across dimensions:
    • Answer quality
    • Speaking skills
    • Confidence
    • STAR methodology adherence
  • Outputs structured JSON with:
    • Scores
    • Strengths
    • Weaknesses
    • Actionable suggestions

The system is designed with anti-score-inflation prompts to ensure consistent and fair evaluation. 


Evaluation

The model’s performance was validated against human-labeled RecruitView benchmarks:

  • 72% of scores fall within ±1.0 of ground truth
  • Mean difference: 0.77 (on a 0–10 scale)
  • Correlation (Pearson r): 0.40
  • Evaluation size: 200 samples (0 failures)

These results demonstrate strong alignment with human evaluation, validating the effectiveness of the LLM-based scoring rubric. 


Key Learnings & Impact

Key Learnings

  • Prompt design is critical: Structured rubrics significantly improve scoring consistency
  • Agentic systems enable realism: Dynamic questioning improves interview authenticity
  • Strict scoring constraints reduce bias and inflation

Impact

  • Enables 24/7 interview practice with no scheduling constraints
  • Provides objective, standardized feedback at scale
  • Empowers users to track improvement over time
  • Democratizes access to high-quality interview preparation

Overall, InterviewPro transforms interview preparation from a scarce, human-limited resource into a scalable, AI-driven experience. 


Acknowledgements

  • Course: DATASCI 210 Capstone, UC Berkeley
  • Instructor: Frederick Nugen, Ramesh Sarukkai
  • Team: Hamza Islam, Kyle Ruan, Rui Sun
  • Technologies & Platforms:

    • Amazon Nova Lite via AWS Bedrock for interview planning and feedback generation
    • OpenAI Realtime for the live voice interview runtime
    • WebRTC for low-latency live audio interaction
    • RecruitView dataset for benchmarking and evaluation analysis

     

Last updated: April 10, 2026