OBP Methodology

Comprehensive Validation Package for Academic Review

This package contains methodology documentation, source code, data schemas, sample datasets, and interactive demos for the OBP analyses. All data sources are publicly available. All code is anonymized. Nothing in this package identifies specific attorneys, firms, or cases.

Contents: 8 documentation files, 4 Python scripts, 1 interactive demo, 1 sample dataset.

1. Section 1328(f) Discharge Eligibility Analysis

Cross-references FJC IDB prior filing history against case disposition across 4.9 million Ch.13 cases to identify 391,951 cases where prior filers received discharge with no evidence of eligibility verification.

Methodology DOC

Full description of the FJC cross-reference approach, data sources, filtering logic, and limitations.

View methodology.md

Findings DOC

National summary: 391,951 cases (27.4%), policy impact, Advisory Committee citation.

View findings.md

District Breakdown - All 94 Districts DATA

The 391,951 finding broken down by all 94 federal judicial districts. Generated from actual FJC IDB data. Includes per-district prior filer rates, discharge rates, and estimated violations.

View district_breakdown.md

Screener Logic (Python) CODE

Standalone Python implementation of the 1328(f) screening algorithm. Takes PACER CSV exports and outputs violations. Also includes FJC IDB national screening mode. Zero dependencies beyond Python 3.8 stdlib.

View screener_logic.py

Interactive Demo DEMO

Self-contained HTML page that runs the screening algorithm in-browser against the sample dataset. Open in any browser - no server needed. Inspect the JavaScript to verify the logic matches the Python implementation.

Open interactive_demo.html

2. Attorney Behavior Analytics

Statistical tools for comparing attorney performance against district baselines. Two approaches: targeted comparison (specific attorney vs. baseline) and blind outlier detection (scan all attorneys, find anomalies).

Methodology DOC

Docket entry classification, response matching, rate calculation, and controls.

View methodology.md

District Comparison Tool (Python) CODE

Compares any attorney's metrics against their district's baseline. Computes Z-scores on 6 dimensions (dismissal, discharge, OSC, MTD, fee density, stay relief). Anonymized - no firm/attorney names in code.

View district_comparison.py

Blind Outlier Detection (Python) CODE

Point-and-shoot anomaly scanner. Give it a court's data, it finds the statistical outliers. No target specified. Ranks all attorneys by composite Z-score. Includes anonymization mode.

View blind_outlier.py

Opposition Rate Calculator CODE

Python script for calculating opposition rates from downloaded PACER dockets. Anonymized.

View opposition_rate.py

RSS Monitoring Methodology DOC

How free, public court RSS feeds are used for real-time case monitoring at $0 cost. Covers architecture, event classification, attorney cross-reference, and replication instructions.

View rss_methodology.md

Sample Output DATA

Anonymized results table showing response rates by motion type.

View sample_output.md

3. Data Documentation

FJC IDB Schema Reference DATA

Complete documentation of every FJC field used in the analysis. Field names, types, code meanings, and how each maps to derived metrics. Includes the local analysis database schema.

View fjc_schema.md

Sample Dataset (CSV) DATA

70-row anonymized dataset for testing the screener. Sequential case IDs (SAMPLE_001-070), generic debtor labels (Debtor_001-040), synthetic dates. Run screener_logic.py against this to verify the algorithm works.

Download sample_dataset.csv

4. Validation Guide

Reproduction Steps DOC

Step-by-step instructions for independently reproducing both analyses using public data.

View VALIDATION.md

Data Access

All analyses use publicly available data:

The GitHub repository with all source code is available on request: github.com/openbankruptcyproject/obp-methodology (private, collaborator access required).

You May Also Find Helpful