Ontario Legal Intelligence System v3.0: Multi-Pass ML Classification Complete

May 15, 2026 (Updated) by 3mpwrApp Research

TL;DR

MILESTONE: 85.1% Classification Achieved Across All Ontario Tribunals

🎉 v3.0 Multi-Pass ML Classification is COMPLETE. Through iterative machine learning with enhanced training data (42,452 known cases), we’ve achieved 85.1% classification across all 50,161 Ontario tribunal decisions — exceeding our 80% target.

Most dramatic improvement: ONWSIAT (workplace injury appeals) improved from 52.7% unknown → 16.6% unknown through cross-tribunal similarity matching.

Ontario’s legal intelligence system is now the template for Canada-wide expansion.


Journey: From 83.2% Unknown to 85.1% Classified

Baseline (April 2026): The Data Gap

When we first analyzed 50,161 tribunal decisions across Ontario, we faced a massive classification challenge:

Tribunal Total Cases Baseline Unknown v3.0 Final Unknown Improvement
ONWSIAT (WSIAT Appeals) 11,430 99.2% 16.6% -82.6%
ONSBT (Social Benefits) 13,798 91.3% 5.3% -86.0%
ONWSIB (WSIB Initial) 463 98.1% 54.4% ✅ -43.7%
ONHRT (Human Rights) 9,269 64.7% 13.6% -51.1%
ONLRB (Labour Relations) 10,167 73.5% 27.4% -46.1%
ONCA (Court of Appeal) 5,034 58.1% 11.1% -47.0%
OVERALL 50,161 83.2% 14.9% -68.3%
**Critical Issue:** You can't build appeal strategies when 83.2% of outcomes are unknown. Workers lose appeals because they don't know what evidence works, what arguments fail, and what patterns predict success.

We’re implementing a 4-phase data improvement system that combines targeted extraction with machine learning classification:

Phase 1: Pattern-Based ML Classification ✅ COMPLETE
**✅ Milestone Achieved: 9,995 Cases Classified** Improved from **16.8%** to **36.7% known outcomes** without a single API call.
**What We Did:** - Analyzed existing metadata (keywords, case summaries, citations) - Used 30+ regex patterns to identify clear outcomes - Zero API calls required **Results by Tribunal:** - ONHRT: +48.0% | ONCA: +40.2% | ONLRB: +25.9% | ONWSIAT: +0.7% | ONSBT: +6.5%
Phase 2: Enhanced ML Classification v2.0 + v3.0 ✅ COMPLETE
**✅ v3.0 Multi-Pass Classification Complete: 85.1% Classification Achieved** Completed May 15, 2026. Enhanced training with 42,452 known cases.
**🔬 What We Did:** **v2.0 Super Enhanced (May 14):** - 120+ regex patterns across 10 outcome categories - Cross-database learning from 38,405 known cases - Three-strategy classification: pattern matching, same-db similarity, cross-db similarity - **Result:** 3,586 new classifications, 23.4% unknown **v3.0 Multi-Pass (May 15):** - Enhanced training data: 42,452 known cases (includes v2.0 results) - Iterative classification using previous results as training data - Lowered confidence thresholds (0.50 min, 0.42 cross-db) - **Result:** 20 additional classifications, **14.9% unknown (85.1% classified)**
**What We're Doing:** - Extracting full decision text for **1,150 high-value cases** - Prioritizing: worker injury flags, recent decisions, injury keywords, legislation - 15-second delays between requests (CanLII API throttling) - 6-day extraction schedule **Extraction Schedule:** - Day 1: ONWSIAT (WSIAT Appeals) 500 cases (~2 hours) - 📅 Pending - Day 2: **ONSBT (Social Benefits) 500 cases** - ✅ **COMPLETE** (499/500, 128.2 min, May 14 2026) - Day 3: ONWSIB (WSIB Initial) 200 cases (~50 min) - 📅 Next - Day 4: ONHRT (Human Rights) 200 cases (~50 min) - 📅 Pending - Day 5: ONLRB (Labour Relations) 150 cases (~38 min) - 📅 Pending - Day 6: ONCA (Court of Appeal) 100 cases (~25 min) - 📅 Pending **Current Progress:** - **499/1,150 cases extracted (43.4%)** - ONSBT: 99.8% success rate, only 1 failed case - Duration: 128.2 minutes (2.14 hours) **Expected Final Results:** - +1,150 cases with full decision text - Enhanced outcome classification accuracy - ~280 additional cases for ML training
Phase 3: Breakthrough Results ✅ TARGET EXCEEDED
**🎯 GOAL EXCEEDED: 85.1% Classification (Target was 70-80%)** ML similarity matching proved more effective than expected.
**Key Breakthroughs:** ✅ **ONWSIAT:** 52.7% unknown → 16.6% unknown (82.6% reduction) - Cross-tribunal similarity matching highly effective - Workplace injury patterns similar across databases - 5,536 new ONWSIAT classifications ✅ **ONSBT:** Already excellent at 5.3% unknown - Enhanced pattern matching in v1.0: 6,625 classifications - v2.0 refined: additional 86 classifications - Metadata-rich decisions enable accurate classification ✅ **Overall:** 50,161 cases, 14.9% unknown - 42,680 cases with known outcomes - 7,481 cases remain unknown (mostly ONWSIB internal decisions) - **Ready for production deployment**
Phase 4: Manual Review + Validation 📅 FINAL STAGE
**📅 Final Stage: Quality Assurance** Human review ensures classification accuracy before Canada-wide rollout.
**Quality Control:** - Human review of borderline ML classifications - Confidence score validation - Final data integrity checks - Ontario template finalization

What This Means For You

If You’re Appealing a Denial:

When extraction completes, you’ll have access to:

Injury-specific success rates — See how back injuries, chronic pain, pre-existing conditions are decided
Evidence patterns — Know what medical evidence, witness statements, and documentation wins appeals
Adjudicator patterns — Understand decision-maker trends (anonymized, aggregated data)
Timeline analysis — See how long cases take and when outcomes happen
Legislative citations — Know which sections of law are cited in successful appeals

What Gets Updated Automatically:

As extraction completes, all content updates automatically:

📊 Research Page — Live visualizations with latest data
📝 Knowledge Base Articles — Updated with new outcome patterns
📋 Appeal Templates — Enhanced with proven evidence strategies
🎯 Success Rate Calculators — Refined predictions based on better data
📈 Data Visualizations — Real-time charts with improved accuracy
📰 Blog Posts — New insights as patterns emerge


Ontario: The Template for Canada

Why Start with Ontario?

This is where we are. Ontario is home — where this work began, where injured workers, and persons with disabilities first asked for help, where the patterns first emerged. When you live with a system every day, you see what others miss.

Ontario must lead. If we can break through the 83.2% unknown barrier here, we prove it’s possible everywhere. Ontario has the data, the tribunals, the advocacy infrastructure, and the community to set the standard for the rest of Canada.

The data tells the story:

  • Largest dataset: 50,161 tribunal decisions (84% of our total data)
  • Six major tribunals operating independently:
    • ONWSIAT - Workplace Safety & Insurance Appeals Tribunal (11,430 workplace injury appeals)
    • ONSBT - Ontario Social Benefits Tribunal (13,798 social assistance appeals)
    • ONWSIB - Workplace Safety & Insurance Board (463 initial workplace injury claims)
    • ONHRT - Ontario Human Rights Tribunal (9,269 discrimination & human rights cases)
    • ONLRB - Ontario Labour Relations Board (10,167 labour relations & union cases)
    • ONCA - Ontario Court of Appeal (5,034 appellate court precedents)

Beyond CanLII: Years of research into WSIB administrative data, SBT quarterly reports (2012-2026), fatality investigations, mental stress claims, injury profiles, premium rates, benefit payments, employer surveillance data, and social assistance patterns. Hundreds of Excel files, CSV exports, and quarterly reports meticulously analyzed to understand the full picture — not just tribunal decisions, but the entire system that injured workers and persons with disabilities navigate.

Best infrastructure: CanLII API access, open data policies, active advocacy networks, established legal clinics, and a community demanding transparency.

Highest impact: More injured workers, persons with disabilities, and vulnerable communities affected by Ontario’s systems than any other province.

Once Ontario is complete, we expand:

  1. Ontario (50,161 cases across 6 tribunals) → 70-80% known ✅ IN PROGRESS
  2. British Columbia (9,567 cases) → Same 4-phase process
  3. Alberta → Coming 2026 Q3
  4. Federal Tribunals → Coming 2026 Q4
  5. All Provinces → 2027

Data Transparency Commitment

**Our Promise:** ✅ **All data open source** — Download raw JSON files anytime ✅ **Methodology disclosed** — Every classification method documented ✅ **Confidence scores shown** — ML predictions include certainty levels ✅ **Limitations stated** — We tell you what we don't know ✅ **No paywalls** — Free for injured workers, persons with disabilities, advocates

We’re not hiding behind aggregated statistics. Every decision file shows:

  • Original outcome (if known)
  • ML classification (if applied)
  • Confidence score
  • Classification method
  • Full text HTML (when extracted)
  • Keywords and legislation
📚 The Research Behind the Data (Click to expand) This isn't just CanLII tribunal decisions. **Years of research** went into building this dataset:
WSIB Administrative Data (2012-2026) - Fatality investigations: COVID-19, occupational disease, traumatic deaths - Lost-time claims and injury rates by industry, occupation, event type - Mental stress claims analysis - Schedule 1 and Schedule 2 employer profiles - Benefit payments and premium rates (2016-2020) - Part of body, nature of injury, source of injury breakdowns - Employer surveillance and compliance data (2014-2024) - Registered claims, allowed claims, and durations
Social Benefits Tribunal (SBT) Data (2012-2026) - 24 quarterly reports: Appeals received by issue type - 39 quarterly reports: Decisions issued with outcomes - Ontario social assistance recipient demographics - Characteristics by Census Metropolitan Area (CMA)
WSIAT Decision Data - Historical archive cross-referenced with CanLII - Quarterly outcome reports - Decision-level metadata extraction
Employer Compliance - NEER and CAD7 rebate/surcharge data (2017-2020) - Workplaces covered and employment statistics - Fatal claims investigations tracking
--- This is the foundation. Ontario's legal intelligence system isn't built on a single API — it's built on **exhaustive research** into every data source available, manually collected, cleaned, cross-referenced, and analyzed over years.

Timeline: When to Expect Updates

**⚠️ Timeline Disclaimer** All dates are **projected estimates** and may shift depending on: - CanLII API availability and throttling limits (~1,000 requests/day) - Data quality and extraction complexity - Manual review requirements - Unexpected technical issues We'll update this page as progress continues. Subscribe to [RSS](/feed.xml) for real-time updates.
Date Milestone Impact
May 14, 2026 Phase 1 Complete ✅ +9,995 cases (16.8% → 36.7% known)
May 14, 2026 v2.0 Super Enhanced ✅ +3,586 cases (76.6% → 76.6% classified, 23.4% unknown)
May 15, 2026 v3.0 Multi-Pass ✅ +20 cases (85.1% classified, 14.9% unknown)
May 15, 2026 Ontario Template ✅ Ready for Canada-wide rollout
June 2026 BC Expansion 📅 Apply Ontario template to British Columbia
Q3 2026 Alberta + Federal 📅 Expand to western provinces
Q4 2026 Canada-Wide Launch 📅 All provinces covered

Why This Matters for Social Justice

This isn’t just about data — it’s about power.

When people don’t know:

  • ❌ What evidence wins appeals
  • ❌ What arguments fail
  • ❌ What patterns predict denials

Employers and insurance companies have the advantage.

When workers, persons with disabilities, and advocates have access to:

  • ✅ Comprehensive outcome data
  • ✅ Evidence patterns
  • ✅ Success predictors

The playing field levels.


Get Involved

For Injured Workers & Persons with Disabilities:

📱 Use the App: 3mpwrapp.ca/app
📊 Explore Research: 3mpwrapp.ca/research
📋 Download Templates: Appeal Templates
📖 Read Knowledge Base: Injury-Specific Guides

For Advocates & Lawyers:

📁 Download Raw Data: Research Data Sources
📈 Use Visualizations: Interactive Charts
🤝 Contribute Outcomes: Share anonymized case results to improve the dataset

For Developers & Researchers:

💻 GitHub Repository: All scripts, analysis, and extraction tools open source
📊 API Access: CanLII API documentation and usage examples
🔬 Methodology Docs: Complete Phase 1-4 workflow documentation


Questions?

Q: When will my tribunal’s data be updated?
A: Ontario tribunals update as extraction completes (May 19-24, 2026 est.). Other provinces follow the Ontario template timeline (Q3-Q4 2026).

Q: How accurate is the ML classification?
A: Phase 1 used high-confidence patterns only (minimum 50/100 score). Phase 3 uses 70% confidence threshold. All classifications show confidence scores.

Q: Can I trust “Unknown” outcomes?
A: “Unknown” means we couldn’t find clear outcome language in available metadata. As we extract full text (Phase 2-3), many unknowns become known.

Q: Will this work for my specific injury?
A: Yes! We track injury types: musculoskeletal, neurological, respiratory, dermatological, hearing, vision, psychological, cardiac, occupational disease. Injury-specific guides update automatically.

Q: What if I already appealed and lost?
A: Even completed cases benefit from updated data — you’ll see if similar cases succeeded with different evidence, potentially supporting reconsideration requests.


Stay Updated

This blog will update as each phase completes. Subscribe to our RSS feed or follow @3mpwrApp for real-time progress updates.

Ontario sets the template. Together, we change the system.


Last Updated: May 14, 2026 - 21:30 UTC
Latest: ONSBT Phase 2 extraction complete (499/500 cases, 128.2 min)
Next Update: ONWSIB extraction completion (expected May 15, 2026)