# Introduction

![Fully Automated. Zero Manual Labels. — Configure, Generate Training Data, Use](/files/TWz3ock8Nf7nzT43CaGZ)

The Lightning Rod SDK provides a simple, but powerful end-to-end API for generating custom synthetic datasets and fine-tuning LLMs.

Transform news articles, documents, and other real-world data into high-quality training samples automatically.

### How It Works

The SDK follows a pipeline-based workflow:

1. **Seeds** — Raw data from news articles, documents, or custom sources
2. **Questions** — AI-generated forecasting questions from seeds
3. **Context** — Optional enrichment with relevant news or RAG-retrieved documents
4. **Labels** — Ground truth answers resolved via web search
5. **Dataset** — Training-ready samples in a format you can use immediately
6. **Train** — Fine-tune models on your datasets *(early access)*
7. **Eval** — Run evals against the test dataset *(early access)*
8. **Inference** — Run predictions with your fine-tuned model *(early access)*

You configure a pipeline, run it, and receive a labeled dataset. No manual question writing or labeling required. With early access, you can also train, evaluate, and run inference with models end-to-end.

### Research Foundation

Lightning Rod is based on our research: [Future-as-Label: Scalable Supervision from Real-World Outcomes](https://arxiv.org/abs/2601.06336). We use this approach to generate the [Future-as-Label training dataset](https://huggingface.co/datasets/LightningRodLabs/future-as-label-paper-training-dataset) for our paper.

### What's Next

* [Quickstart](/python-sdk/getting-started/quickstart.md) — Install the SDK and generate your first dataset in minutes
* [Examples](/python-sdk/getting-started/examples.md) — Run tutorials and end-to-end notebooks in Google Colab
* [Dataset Generation](/python-sdk/dataset-generation/overview.md) — Deep dive into pipelines, seed generators, and question types
* [Forecasting](/python-sdk/forecasting/overview.md) — Get probability estimates with foresight-v3 forecasting model
* [Fine Tuning](/python-sdk/fine-tuning-beta/overview.md) — Fine-tune models on your generated datasets (early access)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.lightningrod.ai/python-sdk/getting-started/introduction.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
