OSHA intelligence platform for workers' comp underwriting — employer risk assessment, pricing triage, and renewal monitoring powered by public safety data and ML analytics.
Find any employer in our database of 4,000,000+ establishments with fuzzy name matching powered by PostgreSQL trigram similarity. Filter by state, city, ZIP code, or NAICS industry.
Complete inspection history, violation details, cited safety standards, and penalty amounts sourced from the DOL API. Over 5.2M inspections and 13.2M violations. Updated nightly with incremental syncs.
Establishment-specific ITA data including Total Recordable Incident Rate (TRIR), DART rate, employee counts, hours worked, and annual trends. 1.4M ITA summaries from 2016 to present.
103K+ hospitalizations, amputations, loss-of-eye incidents, and fatalities reported to OSHA since 2015. Includes event descriptions and outcome details.
Automated 0-100 risk score combining eight weighted components — violation severity, injury rates, severe incidents, enforcement frequency, penalty magnitude, financial health, litigation risk, and news sentiment — each percentile-ranked within the employer's NAICS cohort. Letter grades A-F.
Automated red/yellow/green flag system covering fatalities, willful violations, repeat offenders, high penalties, failure-to-abate, escalating patterns, above-average TRIR, bankruptcy filings, and poor credit scores.
Two-year directional indicators for violations, penalties, severe injuries, TRIR, and DART rate. Instantly see whether an employer's safety posture is improving, stable, or worsening.
Intelligent matching engine links records across OSHA enforcement, ITA, and SIR data sources to canonical employer profiles using trigram pre-filtering and Jaro-Winkler scoring. Human-in-the-loop review queue for low-confidence matches.
Underwriter-facing pages and analytics that deliver immediate value — hazard profiling, peer benchmarking, forward risk assessment, portfolio monitoring, and PDF export.
Redesigned employer profile with sidebar navigation, executive summary card, 6 KPI cards with trend arrows, risk component breakdown, and a merged recent activity feed across all data sources.
Violations grouped into ~30 hazard categories (fall protection, lockout/tagout, machine guarding, etc.) weighted by severity and recency. Maps each hazard to likely workers' comp injury types and average claim costs.
Percentile-ranked comparison across TRIR, DART, penalty rates, and inspection frequency against NAICS peer groups. Includes credibility notes on data coverage and peer cohort size.
12-month probability estimates for new inspections, serious violations, high penalties, and SIR events based on employer history and NAICS base rates. Recidivism pattern analysis detects repeat-standard citations.
One-page PDF summary with configurable sections — risk score, enforcement history, injury rates, peer benchmarks, hazard profile, and forward risk. JSON export for actuarial teams. Server-side generation via ReportLab.
Automatic monitoring for quoted and bound accounts. New inspections, serious violations, SIR events, and risk score changes are flagged on your portfolio dashboard daily. Add any employer or organization to your portfolio directly from their profile page.
Auto-generated investigation questions based on each employer's hazard profile, risk drivers, and anomaly flags — e.g., 'What LOTO program is in place?' or 'Request OSHA 300 logs to verify low reported injury rates.'
Expanded defensible flag system: RED (fatality, willful, repeat, high penalty, failure-to-abate), YELLOW (hospitalization, high TRIR, escalating, underreporting suspicion), GREEN (clean history, below-median TRIR, full abatement compliance).
Calibrated ML risk scoring, NLP hazard extraction, anomaly detection, user authentication, safety news intelligence, and financial data integration.
Calibrated LightGBM + Logistic Regression ensemble predicting probability of material adverse OSHA event in 12 months using 30+ features. SHAP-based explainability shows top-10 risk drivers per employer. Optuna hyperparameter tuning with isotonic calibration. Runs monthly retraining + nightly batch prediction.
Aggregate risk scores across all establishments linked to an organization. Location-level feature aggregation with concentration risk analysis (worst vs. average location). Org-level ML predictions stored separately.
Real-time news search via GNews.io with VADER sentiment analysis and keyword relevance scoring. Integrates into the composite risk score as a weighted component. DB caching with configurable TTL and URL-based deduplication.
On-demand credit report pulls with score tracking, public filings count, and financial health indicators. Cached with configurable TTL. Feeds into composite risk score as a financial health component.
JWT-based authentication with role-based access control (user, manager, admin). Rate-limited login, HttpOnly secure cookies, and persistent portfolios tied to user accounts.
Structured activity logging for compliance — login events, failed auth attempts, match decisions, data exports, password changes, and user management actions via structured log output.
Admin-only smoketest page monitoring database health, Redis status, Celery workers, data pipeline freshness, external API connectivity, data completeness checks, security configuration, and integration tests. Auto-refreshes every 30 seconds.
Extract latent hazard themes from SIR event descriptions and accident investigation narratives using TF-IDF topic modeling and sentence embeddings. Surfaces risk signals that structured standard codes miss.
Isolation forest detects employers with metrics anomalous vs true peer groups — suspiciously low TRIR with high violations (underreporting), violation rate spikes, penalty outliers, and abatement failure clusters.
Causal impact studies, survival analysis, FMCSA carrier data, carrier API, bulk scoring, corporate hierarchy rollups, and geographic risk clustering.
Full FMCSA carrier data and DOT number lookup — safety ratings, BASIC percentile scores, crash history, inspection results, violation records, and out-of-service rates. Linked to OSHA establishments for cross-system risk analysis.
Quasi-experimental analysis using propensity score matching and difference-in-differences to estimate whether abatement completion actually reduces future adverse outcomes. Results displayed per employer and per NAICS cohort.
Cox proportional hazards and Random Survival Forest models for time-to-next-event prediction. Competing risks framework separates serious violations, SIR events, and fat/cat inspections. Calibrated 12-month risk probabilities.
Documented, versioned REST API for carrier rating engines to query risk scores, hazard profiles, and ML predictions. Rate-limited and authenticated with usage tracking.
Upload a CSV of employer names and addresses to batch-process through entity resolution and scoring. Returns a downloadable report with risk scores, red flags, and peer percentiles for every matched employer.
Parent-company detection clusters multi-site employers by normalized name and NAICS code. Aggregate corporate risk scores across all linked establishments for enterprise-level underwriting.
PostGIS-powered spatial analysis of regional hazard concentration, enforcement intensity corridors, and state-plan vs federal-plan patterns. Heatmap overlays on employer profiles.