Consume
Once traces are on Hugging Face Hub, you can read them back as files or through the datasets library.
File-Oriented Access
hf-mount exposes a dataset as a virtual filesystem. That works well for agents that prefer normal file operations.
Install:
curl -fsSL https://raw.githubusercontent.com/huggingface/hf-mount/main/install.sh | sh
Mount and inspect:
hf-mount start repo datasets/your-org/agent-traces /mnt/traces
ls /mnt/traces/data/
head -n 1 /mnt/traces/data/traces_*.jsonl
For private or gated datasets, authenticate first:
hf auth login
Unmount when done:
hf-mount stop /mnt/traces
Structured Access
Use Hugging Face datasets for notebooks, analysis, or training pipelines.
from datasets import load_dataset
ds = load_dataset("your-org/agent-traces")
print(ds["train"][0]["trace_id"])
For streaming:
from datasets import load_dataset
ds = load_dataset("your-org/agent-traces", streaming=True)
for trace in ds["train"]:
print(trace["trace_id"])
Record Shape
Each JSONL line is a TraceRecord. A representative subset looks like:
{
"schema_version": "0.3.0",
"trace_id": "tr_01abc...",
"agent": {
"name": "claude-code",
"model": "..."
},
"task": {
"description": "Fix failing tests in auth module"
},
"metrics": {
"total_steps": 14,
"estimated_cost_usd": 0.031
},
"steps": ["..."]
}
See the schema overview for the full contract.
Local Lookup: Traces, Commits, And Lines
Once you install the git correlator with opentraces setup git, local commands can resolve code history back to traces.
Group traces by commit
opentraces list --by-commit
opentraces --json list --by-commit
Resolve a commit back to traces
opentraces blame abc1234
opentraces blame abc1234 src/auth.py
opentraces blame abc1234 src/auth.py --lines
opentraces --json blame abc1234
blame takes a commit SHA (bare or c:<sha>) and an optional path to scope output to one file. Use --lines for git-blame-style per-line output. This is useful for provenance, code archaeology, and dataset filtering by evidence quality.
Choosing An Access Pattern
- Use
hf-mountwhen the consumer wants to browse files or let an agent inspect shards directly - Use
datasetsfor notebooks, analysis jobs, and training pipelines - Use local
list --by-commitandblamefor repo-specific provenance work