evalstate/transformers-pr-api-data / snapshots /hf-evalstate--transformers-pr--main
872 MB
330 files
Updated about 4 hours ago
NameSize
analysis-state
analysis
snapshots
state
README.md1.92 kB
xet
analysis-report.json46.4 kB
xet
comments.parquet2.35 MB
xet
events.parquet730 kB
xet
issues.parquet997 kB
xet
links.parquet43.1 kB
xet
manifest.json34.7 kB
xet
new-contributors-report.json873 kB
xet
new-contributors-report.md528 kB
xet
new_contributors.parquet62.4 kB
xet
pr-scope-clusters.json137 kB
xet
pr_diffs.parquet47 MB
xet
pr_files.parquet39.9 MB
xet
pull_requests.parquet1.88 MB
xet
review_comments.parquet2.49 MB
xet
reviews.parquet767 kB
xet
README.md

Transformers PR Dataset

Normalized snapshots of issues, pull requests, comments, reviews, and linkage data from huggingface/transformers.

Files:

  • issues.parquet
  • pull_requests.parquet
  • comments.parquet
  • issue_comments.parquet (derived view of issue discussion comments)
  • pr_comments.parquet (derived view of pull request discussion comments)
  • reviews.parquet
  • pr_files.parquet
  • pr_diffs.parquet
  • review_comments.parquet
  • links.parquet
  • events.parquet
  • new_contributors.parquet
  • new-contributors-report.json
  • new-contributors-report.md

Use:

  • duplicate PR and issue analysis
  • triage and ranking experiments
  • eval set creation

Notes:

  • latest snapshot: 20260516T000042Z
  • raw data only; no labels or moderation decisions
  • PR metadata, file-level patch hunks, and full unified diffs are included
  • full file contents for changed files are not included
Total size
872 MB
Files
330
Last updated
May 16
Pre-warmed CDN
US EU US EU

Contributors