Subject: Partnership Request: Improving Tribunal Outcome Metadata for Vulnerable Communities

To: feedback@canlii.org
CC: info@canlii.org
Date: April 29, 2026

Dear CanLII Team,

I am writing to request your assistance in addressing a significant data quality issue affecting vulnerable communities across Canada: the lack of standardized outcome metadata in workers' compensation and human rights tribunal decisions published on CanLII.

As developers of an open-source legal aid platform serving injured workers and persons with disabilities, we have discovered that **95.8% of workers' compensation appeal decisions (18,816 cases across Ontario and BC, 2020-2026) lack outcome categorization** in your API responses. This data gap prevents marginalized communities from conducting evidence-based legal research without expensive legal representation.

## Background: Our Platform

**3mpwrApp** is a free mobile and web app that helps injured workers understand their legal rights and navigate tribunal appeals. Our app provides:

- **Legal Case Database:** Searchable library of 42,000+ tribunal decisions with plain-language explanations
- **Outcome Predictions:** AI tools that estimate appeal success chances based on case history
- **Self-Help Resources:** Step-by-step guides, templates, and winning argument examples
- **Community Support:** Connection to legal aid clinics and peer support networks
- **Fully Accessible:** Free forever, works offline, designed for users with disabilities

**Who we serve:** Low-income injured workers, persons with disabilities, and marginalized communities who cannot afford expensive legal representation.

## The Problem: Missing Outcome Metadata in CanLII API

We have analyzed **42,314 tribunal decisions** from your API and discovered a **critical data quality problem affecting workers' compensation tribunals across Canada:**

### Cross-Provincial Analysis (2020-2026):

| Tribunal | Province | Cases | Unknown Outcomes | Rate |
|----------|----------|-------|------------------|------|
| **WSIAT** | Ontario | 11,430 | 10,491 | **91.8%** |
| **BC WCAT** | British Columbia | 7,386 | 7,386 | **100.0%** |
| **ONWSIB** | Ontario | 120 | 112 | **93.3%** |
| **ONSBT** | Ontario | 818 | 780 | **95.4%** |
| **HRTO** | Ontario (Human Rights) | 200 | 80 | **40.0%** |
| **BCHRT** | BC (Human Rights) | Running now | TBD | TBD |

**Key Finding:** Workers' compensation tribunals have **90-100% unknown outcome rates** in CanLII API responses, while human rights tribunals have only 30-40% unknown rates.

### Technical API Challenges:

1. **No Standardized Outcome Field:** API responses lack an `outcome` field entirely
2. **Inconsistent Keywords:** The `keywords` field contains injury terms but rarely outcomes ("chronic pain" vs "appeal dismissed")
3. **No Full Text Access:** API returns metadata only, not full decision text needed for NLP extraction
4. **Non-Semantic Titles:** Outcome language varies by tribunal without standardized tags

### Impact on Vulnerable Communities:

This data opacity **disproportionately harms marginalized people:**

- **18,816 workers' comp decisions** (ON + BC alone) are invisible for outcome research
- **Injured workers** cannot assess appeal chances without hiring lawyers
- **Legal aid clinics** cannot conduct evidence-based case triage
- **Researchers** cannot study systemic discrimination in tribunal decisions
- **Self-reps** waste 6-12 months on doomed appeals

## Immediate Technical Barriers

**We are currently blocked from collecting this data** to serve vulnerable communities due to CanLII's security measures:

### API Throttling
- **Current Experience:** Our data collection scripts are being rate-limited after ~50-100 requests
- **Impact:** Collecting 42,314+ tribunal decisions requires multiple days of interrupted scraping
- **Our API Key:** `5VMAI9UyXp1syvy4nEAM58QfpGZInsTF` (registered for community legal research)

### Browser-Based Scraping Blocked
- **CAPTCHA challenges** appear when attempting to scrape full decision text from CanLII.org
- **HTTP 403 Forbidden** responses block automated access even with proper user-agent headers
- **Bot detection** prevents fallback methods when API data is incomplete

### Questions for CanLII:

1. **Can you increase our API rate limits?** We are a non-profit community project serving marginalized workers, not a commercial scraper. Higher limits would enable us to collect complete datasets for evidence-based research.

2. **Do you have bulk data exports available?** Rather than scraping 42,000+ decisions via API, could CanLII provide:
   - Complete tribunal decision exports (CSV/JSON) with full text and metadata?
   - Annual data dumps for researchers and legal tech developers?
   - FTP/S3 access for large-scale data collection?

3. **Can you clarify API usage terms?** We want to comply with CanLII's policies while serving vulnerable communities:
   - What is the intended rate limit for research/non-commercial use?
   - Are there separate limits for legal aid organizations vs commercial users?
   - Can we apply for a "researcher" or "legal aid" tier with higher quotas?

**Why This Matters:** Without access to complete tribunal data, **18,816 injured workers' compensation cases remain invisible** to the communities they affect most. Legal aid clinics, disabled workers, and self-represented litigants cannot conduct evidence-based research because we are blocked from accessing public legal information.

## Our Request: Help Us Fix This Problem

We respectfully request **CanLII's partnership** to improve tribunal data access for vulnerable communities. We propose solutions in order of implementation priority:

### Priority 1: Immediate Data Access (Weeks)

**Help us access the data that already exists:**

**A) Increase API Rate Limits for Non-Profit Use**
- Current limit appears to be ~50-100 requests before throttling
- Requesting: 500-1000 requests/hour for registered legal aid projects
- Justification: We are a non-commercial platform serving marginalized communities, not a for-profit scraper

**B) Provide Bulk Data Exports**
- **Ideal format:** Annual CSV/JSON dumps of all tribunal decisions with full text
- **Alternative:** FTP/S3 access for large-scale collection by researchers
- **Precedent:** Many government open data portals provide bulk downloads (e.g., Ontario Open Data, BC Data Catalogue)

**Why this helps:** Eliminates technical barriers blocking vulnerable communities from accessing public legal information that already exists on CanLII.

### Priority 2: API Enhancement (Months)

Add a standardized `outcome` field to CanLII API responses for tribunal decisions:

```json
{
  "caseId": "2024onwsiat123",
  "title": "Decision No. 123/24",
  "outcome": {
    "category": "appeal_allowed",
    "worker_win": true,
    "confidence": "high"
  }
}
```

**Standardized Categories:**
- `appeal_allowed`, `appeal_dismissed`, `varied`, `remitted`, `withdrawn`, `consent`, `no_jurisdiction`

**Benefit:** Helps all researchers, legal tech developers, and vulnerable communities using CanLII data.

### Priority 3: Full Text API Access (Months)

Provide full decision text (not just metadata snippets) in API responses, enabling natural language processing for outcome extraction.

**Precedent:** Many courts/tribunals provide full-text APIs (e.g., US Courts PACER API)

**Why this helps:** Enables us to extract outcomes from full text when metadata is incomplete, reducing unknown rates from 90%+ to <20%.

### Priority 4: Long-Term Partnership with Tribunals (Years)

Work with tribunals (WSIAT, BC WCAT, etc.) to standardize outcome reporting in their submissions to CanLII.

**Precedent:** Human rights tribunals (HRTO, BCHRT) already do this better (40% unknown vs 100% for workers' comp)

**Why this helps:** Fixes the problem at the source - tribunals submit structured outcome data with every decision.

## Public Benefit

Improving outcome metadata in CanLII will directly support **injured workers, persons with disabilities, and vulnerable communities** by enabling:

1. **Transparency for Marginalized Communities:** Injured workers (often low-income, disabled, racialized) can see actual tribunal success rates by issue type (chronic pain, PTSD, pre-existing conditions) without hiring expensive counsel

2. **Evidence-based Self-Advocacy:** Workers can assess appeal chances based on historical outcomes, reducing reliance on costly representation they cannot afford

3. **Systemic Accountability:** Researchers and advocates can identify potential discrimination patterns or systemic barriers affecting vulnerable populations

4. **Improved Free Legal Tech:** Enhance outcome prediction accuracy from 79% to 85%+ for tribunal-specific cases, making AI tools more reliable for disabled and injured workers

5. **Cross-Provincial Equity:** Compare outcomes across provinces (WSIAT vs BC WCAT vs AB) to identify best practices and advocate for access to justice improvements

6. **Legal Aid Clinic Efficiency:** Clinics can triage cases based on outcome likelihood, serving more clients with limited resources

**All our work is open source** under Creative Commons licensing, following CanLII's principles of open legal information.

## Why This Aligns with CanLII's Mission

CanLII's mission is **"free access to Canadian law for all Canadians."** However, access to *law* without access to *outcomes* means:

- Workers cannot assess their legal positions
- Self-reps cannot make informed decisions
- Researchers cannot study access to justice barriers
- Legal AI tools cannot serve vulnerable communities

**CanLII already does this well for higher courts:** Supreme Court, Court of Appeal decisions have clear outcomes. We're asking for the same standard for tribunals serving vulnerable populations.

## Timeline & Collaboration

**Immediate Need (Priority 1):** We need data access solutions within 2-4 weeks to continue serving vulnerable communities. Without increased API limits or bulk data exports, our platform cannot provide accurate legal guidance to 18,816+ workers' compensation cases.

**Long-Term Partnership (Priorities 2-4):** We understand API enhancements and tribunal partnerships require planning. We are happy to:

- **Pilot test** any new outcome metadata fields
- **Provide feedback** on standardized outcome taxonomies
- **Share our extraction code** (open-source NLP models) to help tribunals tag decisions
- **Wait 6-12 months** for API enhancements
- **Collaborate with tribunals** on long-term data quality improvements

If CanLII cannot provide bulk data or increased API limits directly, we would appreciate being connected to:
- **CanLII technical team** to discuss API quota increases or bulk export options
- **Tribunal IT contacts** who may have alternative data sources
- **Other researchers/legal aid orgs** who have solved similar data access challenges

## About 3mpwrApp

3mpwrApp is a **non-commercial, community-driven platform** built to support injured workers across Canada. Key features:

- 100% free and open source (no paywalls, no subscriptions)
- Fully accessible (WCAG 2.2 AA+)
- Privacy-first architecture (local-first data storage)
- Developed by advocates for injured workers

**Website:** [https://3mpwrapp.ca](https://3mpwrapp.ca)  
**GitHub:** [https://github.com/3mpwrApp](https://github.com/3mpwrApp) (planned - code will be published here)  
**Research Page:** [https://3mpwrapp.github.io/research/](https://3mpwrapp.github.io/research/)

## Contact Information

**Name:** [Your Name]  
**Email:** empowrapp08162025@gmail.com  
**Organization:** 3mpwrApp (Community Project)  
**Role:** Project Lead & Developer

## Thank You

Thank you for CanLII's continued commitment to open access to justice. Your platform is invaluable to Canadians who cannot afford legal representation. We hope to partner with you to extend that access from *legal text* to *legal outcomes* - the missing piece that prevents vulnerable communities from truly understanding their rights.

If you have questions about this request, our technical approach, or how we can support CanLII's mission, please don't hesitate to contact me.

Sincerely,

[Your Name]  
3mpwrApp Project Lead  
empowrapp08162025@gmail.com

---

## Appendix: Technical Details

### Our Current Data Collection Methodology

- **Source:** CanLII API v1 (`https://api.canlii.org/v1/caseBrowse/`)
- **API Key:** `5VMAI9UyXp1syvy4nEAM58QfpGZInsTF` (registered for legal research)
- **Coverage:** 42,314 decisions across 8 tribunals, 7 provinces (2020-2026)
- **Outcome Extraction:** Keyword matching + NLP on `title` and `keywords` fields (limited by API data quality)
- **Success Rate:** 79% accuracy for human rights tribunals, <10% for workers' comp tribunals

### Technical Barriers We're Experiencing

**API Rate Limiting:**
- Throttled after ~50-100 requests within short time period
- No clear documentation on rate limits for non-commercial use
- Forces us to spread collection across multiple days with manual delays

**Browser Scraping Blocked:**
- HTTP 403 Forbidden when using automated tools (even with proper headers)
- CAPTCHA challenges appear on CanLII.org when attempting bulk collection
- Bot detection prevents fallback when API data is incomplete

**Impact:** These barriers prevent us from providing complete legal research to injured workers who need it most.

### Sample API Response (Current - Missing Outcomes):

```json
{
  "caseId": "2024onwsiat123",
  "title": "Decision No. 123/24",
  "citation": "2024 ONWSIAT 123",
  "keywords": "chronic pain, pre-existing condition, entitlement",
  "decisionDate": "2024-03-15"
  // No outcome field!
}
```

### Proposed API Response (With Outcome Metadata):

```json
{
  "caseId": "2024onwsiat123",
  "title": "Decision No. 123/24",
  "citation": "2024 ONWSIAT 123",
  "keywords": "chronic pain, pre-existing condition, entitlement",
  "decisionDate": "2024-03-15",
  "outcome": {
    "category": "appeal_allowed",
    "description": "Appeal allowed in part",
    "worker_win": true
  }
}
```
