🏀Zerve chosen as NCAA's Agentic Data Platform for 2026 Hackathon·🏆Zerve × ODSC AI Datathon — $10k Prize Pool·📈We're hiring — awesome new roles just gone live!
Back

Insurance Claim Prediction Pipeline

vijayashreev2002
March 22, 2026

About

Health Insurance Claim Risk Prediction using LightGBM

A machine learning solution that predicts the probability of insurance customers

filing health insurance claims. Built with LightGBM to handle highly imbalanced

data (95:5 ratio) and optimized for PR-AUC metric. Includes comprehensive EDA,

stratified validation, and interactive prediction system for risk assessment.


Tech Stack: Python, LightGBM, Scikit-learn, Pandas, Matplotlib

Metric: PR-AUC (Precision-Recall Area Under Curve)

Related Topics

Decision-grade data work

Explore, analyze and deploy your first project in minutes