Pruna OSS is turning 1! To mark this milestone, we're launching the First Prune initiative.
What's First Prune: If you contribute to open issues at our GitHub repo, you earn Pruna Inference API credits.
How you can participate: β’ Pick an open issue labelled "first-prune" and assign it to you β’ Submit your PR and mark it ready for review by April 30 β’ Find out more in the PR template when you open a PR
More OSS than ever with the latest pruna 0.3.2 release. It extends existing algorithm families, such as compilers, kernels, and pruners, and adds new ones, including decoders, distillers, enhancers, and recoverers. But it's not only a collection of algorithms; instead, you can easily combine them to get the biggest efficiency win.
Announcing RealPerformance, a dataset of functional issues of language models that mirrors failure patterns identified through rigorous testing in real LLM agents