AI & ML interests
None defined yet.
Recent Activity
Papers
Future-as-Label: Scalable Supervision from Real-World Outcomes
Outcome-based Reinforcement Learning to Predict the Future
Lightning Rod Labs
Train with Timestamps, Not Labels.
Lightning Rod Labs automatically generates high-quality training data from your documents or public sources — no labeling or extraction required. Define your criteria in Python, and our SDK treats real-world outcomes as the label, producing high-signal supervision at scale. Models learn causal factors, not just tokens. Raw data to deployable specialized models in hours.
How It Works
We generate grounded, model-ready training data from documents or public sources (Google News, SEC filings, market data). You define your criteria in Python, and our SDK uses the future as the label — turning messy, timestamped history into training signal automatically. No labeling pipelines, no extraction, no human annotation.
This approach has been used to beat frontier AIs 100x larger on prediction-market benchmarks, and has demonstrated success in financial forecasting, risk estimation, and policy prediction.
Research & Results
- SEC Risk Prediction: Foresight learning on raw SEC filings trains a 32B model to outperform GPT-5 at predicting public company risks.
- Future-as-Label: AI learns directly from raw chronological news data at unlimited scale, no human annotation.
- Outcome-based RL (TMLR): Using RL to improve LLM forecasting ability from real-world outcomes.
- Foresight-32B vs. Frontier LLMs: Live demonstration beating frontier models on Polymarket predictions.
Foresight-32B is consistently top-ranked on ForecastBench and ProphetArena Sports.
Models & Datasets
| Resource | Description |
|---|---|
| Trump-Forecaster | RL-tuned gpt-oss-120b LoRA adapter for predicting Trump administration actions. Beats GPT-5 (Brier 0.194 vs 0.200). |
| Golf-Forecaster | RL-tuned gpt-oss-120b LoRA adapter for predicting professional golf outcomes. Beats GPT-5.1 (Brier 0.207 vs 0.218). |
| WWTD-2025 | 2,790 binary forecasting questions about U.S. policy under the Trump administration, with news context and ground-truth resolutions. |
| GolfForecasting | 4,033 binary forecasting questions about professional golf across PGA Tour, LIV Golf, LPGA, and majors. |