Buckets:
872 MB
330 files
Updated about 4 hours ago
Ctrl+K
| Name | Size | Uploaded | Xet hash |
|---|---|---|---|
| analysis-state | 2 items | ||
| analysis | 3 items | ||
| snapshots | 307 items | ||
| state | 1 items | ||
| README.md | 1.92 kB xet | a94c646b | |
| analysis-report.json | 46.4 kB xet | 21064965 | |
| comments.parquet | 2.35 MB xet | 475bbd9f | |
| events.parquet | 730 kB xet | 673d973c | |
| issues.parquet | 997 kB xet | da7fea48 | |
| links.parquet | 43.1 kB xet | 86bc8c54 | |
| manifest.json | 34.7 kB xet | 1ca10cec | |
| new-contributors-report.json | 873 kB xet | acfa5edb | |
| new-contributors-report.md | 528 kB xet | f146fbd5 | |
| new_contributors.parquet | 62.4 kB xet | 21337a68 | |
| pr-scope-clusters.json | 137 kB xet | 916b2ea9 | |
| pr_diffs.parquet | 47 MB xet | 8114ee97 | |
| pr_files.parquet | 39.9 MB xet | 9462165d | |
| pull_requests.parquet | 1.88 MB xet | 51a1eca8 | |
| review_comments.parquet | 2.49 MB xet | 8a06f32f | |
| reviews.parquet | 767 kB xet | c66e9995 |
Transformers PR Dataset
Normalized snapshots of issues, pull requests, comments, reviews, and linkage data from huggingface/transformers.
Files:
issues.parquetpull_requests.parquetcomments.parquetissue_comments.parquet(derived view of issue discussion comments)pr_comments.parquet(derived view of pull request discussion comments)reviews.parquetpr_files.parquetpr_diffs.parquetreview_comments.parquetlinks.parquetevents.parquetnew_contributors.parquetnew-contributors-report.jsonnew-contributors-report.md
Use:
- duplicate PR and issue analysis
- triage and ranking experiments
- eval set creation
Notes:
- latest snapshot:
20260516T000042Z - raw data only; no labels or moderation decisions
- PR metadata, file-level patch hunks, and full unified diffs are included
- full file contents for changed files are not included
- Total size
- 872 MB
- Files
- 330
- Last updated
- May 16
- Pre-warmed CDN
- US EU US EU