Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Unitxt

community
https://www.unitxt.ai
Activity Feed Request to join this org

AI & ML interests

IBM Research

Elron Bandel's profile picture

Elron 
authored 7 papers 2 months ago

AlephBERT:A Hebrew Large Pre-Trained Language Model to Start-off your Hebrew NLP Application With

Paper • 2104.04052 • Published Apr 8, 2021

Efficient Benchmarking (of Language Models)

Paper • 2308.11696 • Published Aug 22, 2023

Quality Controlled Paraphrase Generation

Paper • 2203.10940 • Published Mar 21, 2022

Lexical Generalization Improves with Larger Models and Longer Training

Paper • 2210.12673 • Published Oct 23, 2022

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Paper • 2407.13696 • Published Jul 18, 2024 • 5

DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation

Paper • 2503.01622 • Published Mar 3, 2025

General Agent Evaluation

Paper • 2602.22953 • Published Feb 26 • 11
Elron 
updated a Space 4 months ago
Running

Unitxt Metric

📈

Evaluate AI model performance on diverse tasks and benchmarks

Elron 
updated a dataset 4 months ago

unitxt/data

Updated Jan 13 • 5.11k
Elron 
authored a paper over 2 years ago

Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

Paper • 2401.14019 • Published Jan 25, 2024 • 23
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs