CareAssist Banner Logo
MIDS Capstone Project Spring 2026

CareAssist: An AI-Driven Case Prioritization Tool for Foster Care

Motivation

There are over 400,000 children in the U.S. foster care system at any given time, and roughly 38% of placements end in disruption—meaning the child is uprooted and moved again. Each move compounds trauma, severs relationships with teachers, therapists, and peers, and disrupts the stability children need to heal. For our team, this problem is deeply personal: one of our teammates, Samantha Townsend, grew up as the eldest sister in a foster home and witnessed firsthand how placement instability and fragmented information systems affect children and the families who care for them. Her experience directly shaped how we approached every design decision in CareAssist.

Social workers are the frontline defense for these children, yet they manage caseloads of 15–30+ cases using monthly visits, scattered notes, and county-specific systems that do not communicate with each other. When a child moves placements, sometimes across county lines, their medical history, behavioral data, school records, and prior case notes do not automatically follow them. Critical context is lost at the exact moment it matters most.

CareAssist addresses this gap with a unified, AI-powered platform that preserves a child's complete history, surfaces machine-learning-driven early warnings of placement disruption, explains why each child is flagged using SHAP-based feature attribution, and provides an AI assistant that lets caseworkers query their caseload in natural language—helping them intervene before placements fail, not after.

Data Source

The primary data source is the federal AFCARS (Adoption and Foster Care Analysis and Reporting System) dataset, containing case-level records submitted by all 50 states to the U.S. Children's Bureau. We used approximately 5.76 million de-identified records spanning fiscal years 2020–2024, with 350,000–400,000 new records per year. Key fields include placement history, removal reasons, demographics, disability and behavioral indicators, permanency goals, and case outcomes.

Data Science Approach

Predictive Modeling

CareAssist predicts placement disruption—whether a child's current living arrangement will break down within 12 months—using a weighted ensemble of four models: XGBoost (35%), LightGBM (35%), CatBoost (20%), and a multilayer perceptron (10%). Each model was independently tuned with 50 Optuna trials and 5-fold cross-validation across 65 features (20 baseline AFCARS fields plus 45 engineered features including interaction terms, severity composites, and nonlinear transformations). The weighted ensemble outperformed all individual models and alternative combination strategies including stacking and rank averaging.

Ensemble Architecture

Weights were optimized through 50 Optuna trials and 5-fold stacking cross-validation:

ModelWeightKey Hyperparameters
XGBoost35%822 rounds, depth 13, lr 0.046
LightGBM35%1,911 rounds, depth 13, 377 leaves
CatBoost20%1,387 iterations, depth 12
MLP10%scikit-learn neural network

Individual Model Evaluation

Each model was independently tuned with 50 Optuna trials and 5-fold cross-validation. LightGBM narrowly emerged as the strongest standalone performer, with XGBoost close behind. The MLP, while outperforming the baseline, reflected the relative difficulty of capturing feature interactions with a shallow neural architecture compared to tree-based methods.

ModelROC-AUCAvg PrecisionF1 Score
Baseline XGBoost (20 features)0.9041N/AN/A
Tuned XGBoost (65 features)0.91960.85990.7815
Tuned LightGBM (65 features)0.92000.86050.7823
Tuned CatBoost (65 features)0.91660.85480.7773
MLP (65 features)0.90810.84060.7442

Ensemble Model Evaluation

We evaluated several ensemble strategies to determine the most effective way to combine predictions. While all approaches produced strong results, the weighted averaging method consistently achieved the best overall performance, slightly outperforming stacking and simple averaging. Compared to stacking, the weighted approach provided a more stable and interpretable aggregation while effectively leveraging the strengths of each base learner.

Ensemble MethodROC-AUCAvg PrecisionF1 Score
Weighted Average (XGB 35%, LGB 35%, CAT 20%, MLP 10%)0.92050.86150.7836
Stacking (Logistic Regression meta-learner, 5-fold)0.92030.86120.7722
Simple Average (3 GBDT only)0.92010.86090.7827
Rank Average0.91910.85930.7781
Simple Average (all 4 models)0.91900.85880.7796

Performance Summary

0.9205 ROC-AUC
0.8615 Avg Precision
0.784 F1 Score
92% Recall

We optimized for high recall because missing a child at risk is worse than a false alarm.

Features (65 total)

20 baseline features from AFCARS + 45 engineered features including: age², disability severity composite, abuse severity composite, substance abuse score, LOS ratio, interaction terms (age × disability, removals × abuse), one-hot placement types, permanency goals, and removal reason flags.

Top Feature Importances (Ensemble-Level)

Feature importance analysis reveals that placement trajectory variables—length of stay, placement type, and prior removals—dominate predictive signal. The relative importance of these core features remained consistent across models, suggesting that predictive performance was driven by effectively capturing a stable set of high-signal variables.

RankFeatureImportanceInterpretation
1Length of stay (latest removal)13.0%Longer time in current removal episode elevates risk
2Length of stay (current setting)12.5%Extended stay in a single setting signals potential stagnation
3Placement type (group home)11.0%Group home placements carry highest disruption rates
4Placement type (residential)9.5%Residential/institutional care is high-risk
5Total prior removals5.2%Repeated removals compound instability
6Age at removal4.8%Older children face more placement challenges
7Disability severity composite3.8%Multiple disabilities increase placement strain
8Behavioral indicators3.8%Behavioral needs are a key disruption driver
9Abuse severity composite3.5%Severity of maltreatment history compounds risk
10Placement type (foster family)3.5%Traditional foster care — moderate baseline risk
11Placement type (kinship)3.2%Kinship placements are more protective
12Age² (quadratic)3.2%Captures nonlinear age effects (teens at highest risk)

SHAP Explainability

A risk score alone is not enough—caseworkers need to understand why a child is flagged to trust and act on the prediction. We integrate SHAP (SHapley Additive exPlanations) to decompose every prediction into per-feature contributions. Each factor is shown with its direction and magnitude: red for risk-increasing drivers (e.g., multiple prior placements, group home setting) and green for protective factors (e.g., kinship care, short length of stay). Workers can verify the model's reasoning against their own clinical knowledge, building the trust necessary for adoption in high-stakes environments.

AI Case Assistant

Beyond prediction, CareAssist integrates a containerized Ollama/Llama 3.2-based AI assistant backed by a SQL data layer. Caseworkers can ask natural-language questions like "Which of my cases have the highest risk?" or "What are the active flags for Child X?" and receive grounded, data-driven answers pulled directly from the case database—not hallucinated text. The assistant is restricted to queryable data, ensuring responses are always factual and auditable.

The Application: Four Roles, One Platform

CareAssist is a production-grade, full-stack web application built with Angular 17 (TypeScript, standalone components, lazy-loaded routes) on the frontend, FastAPI (Python, async, Pydantic schemas) on the backend, and SQLite via SQLAlchemy async ORM (swappable to PostgreSQL) as the data layer. It is deployed on AWS (Lambda, API Gateway, S3, CloudFront) and serves four distinct user roles, each with a tailored experience:

Social Worker

The core user. A command-center dashboard surfaces flagged cases with ML-generated risk scores, urgency alerts, and a sortable caseload table. Clicking any case opens a detail view with four tabs: Overview (risk gauge, 6-month trend chart, SHAP explanation panel, active flags with severity and recommendations, and a sibling linkage map), Timeline (color-coded placement history), Family (interactive family tree with contact information and safety badges), and Notes (shared notes with foster parents plus official case documentation). The AI assistant is accessible from every page.

Supervisor

A team-level overview dashboard showing aggregate metrics across all social workers: total cases, flagged cases, average caseload, and workload distribution. Supervisors can expand any worker's card to review their individual caseload, click into any case for the full detail view including SHAP, and access team-level reports and the AI assistant.

Foster Parent

A Family Portal designed for caregivers, not case managers. Foster parents see their children's profiles, need tags (medical, behavioral, disability), and can message their social worker, upload documents, and access a curated resource directory covering training, support groups, financial assistance, healthcare, and legal rights. No risk scores or ML outputs are shown; access is scoped to what caregivers need.

Aged-Out Youth

A self-advocacy portal for young adults who have aged out of care. When youth turn 18, they often lose access to their own records. CareAssist gives them a unified timeline of their medical history, school records, and placement history, plus a resource hub for transitional housing, job training, education grants, extended Medicaid, and legal aid. They can stay connected with former caseworkers and mentors through built-in messaging.

Evaluation

Methodology

We evaluated model performance using AUC-ROC, recall, precision, average precision (AP), and F1 score to capture both overall predictive power and practical decision-making performance. AUC-ROC assessed the model's ability to distinguish between disruption and non-disruption cases across all thresholds, while AP summarized precision-recall performance in the presence of class imbalance. We intentionally prioritized recall over precision—a decision directly informed by practicing social workers, including veteran worker Otoya Brown, who emphasized that no child should fall through the cracks. A missed at-risk case carries far greater consequences than an extra review. The final operating threshold was validated in collaboration with caseworkers to balance sensitivity with operational feasibility.

Results

Our final model, a weighted ensemble of tuned tree-based methods, achieved the strongest overall performance: AUC 0.9205, 92% recall, and 63% precision. This represents a clear improvement over the baseline XGBoost model (AUC 0.9041, 91% recall, 58% precision) and outperformed all individual models and alternative ensemble strategies including stacking and rank averaging.

ModelFeaturesAUC-ROCRecallPrecisionF1 Score
Baseline XGBoost200.904191%58%N/A
Final Weighted Ensemble650.920592%63%0.7836

Performance gains were driven by combining complementary model signals rather than increasing complexity within a single model. Predictive power was concentrated in a stable set of high-impact features, with prior placements emerging as the dominant driver of disruption risk. SHAP explanations expose these drivers at the individual case level, making each prediction transparent and actionable.

Key Learnings

Human-centered evaluation is essential.

Traditional metrics alone are insufficient for real-world deployment. Determining the right balance between false positives and false negatives required close collaboration with social workers, who helped us calibrate the system to flag high-risk cases without overwhelming workers with unnecessary alerts.

Responsible AI requires built-in safeguards.

Deploying ML in foster care demands careful attention to access control and AI safety. We implemented role-based access aligned with the principle of least privilege—foster parents never see risk scores; youth see only their own records. The AI assistant is restricted to grounded, queryable data to prevent hallucination in a domain where inaccurate information could directly harm children.

Usability drives impact.

Model performance means nothing if practitioners cannot act on it. Every prediction is paired with a SHAP explanation, every flag includes a recommended action, and the AI assistant lets workers query their caseload in plain English. CareAssist bridges the gap between statistical performance and real-world decision-making.

Impact

There are over 600 child welfare agencies in the U.S. and more than $30 billion spent annually on child welfare programs, yet existing systems are built for compliance and reporting—not for identifying risk proactively. CareAssist reimagines casework as a proactive, insight-driven practice. By surfacing AI-driven early warnings with transparent explanations, the system has the potential to increase placement stability, reduce time in care, improve conditions for successful reunification, and—most importantly—ensure that no child falls through the cracks.

Acknowledgements

We would like to express our sincere gratitude to our capstone instructors, Joyce Shen and Zona Kostic, for their invaluable feedback, support, and guidance throughout this project.

We also extend a heartfelt thank you to the social workers, particularly Otoya Brown and her network; the foster parents, Amanda and Christopher Townsend; the current and former foster children, including Hannah and Hailey Townsend and Nichol Flowers; and the various government leaders and supervisors, including Robert Metthe, for their continued support and interest in this application. Their perspectives helped ensure this decision-support tool is grounded in real-world needs.

Looking ahead, our goal is for CareAssist to become the system of record for the U.S. foster care system — one intelligent platform serving the over 400,000 children in care nationwide.

Lastly, we are deeply appreciative of our team for their dedication, collaboration, and commitment to building a solution with meaningful impact.

Last updated: April 22, 2026