compare different llama versions for knowledge cutoff
generates linkedin posts from freetext entries
compare different models and their moral compass
MVP demo of multilingual LLM performance eval space
llm calculator with llama backend
severely limited context window proof of concept
compare responses between non-RAG and RAG model
simulates the RLHF training step of an LLM
neutral sd gradio dev space
be polite and rude to llama
try and get llama to talk about milk
measures CO2e generated from a single query to an LLM