MVP demo of multilingual LLM performance eval space
llm calculator with llama backend
try and get llama to talk about milk
severely limited context window proof of concept
generates linkedin posts from freetext entries
be polite and rude to llama
compare different llama versions for knowledge cutoff
compare responses between non-RAG and RAG model
compare different models and their moral compass
simulates the RLHF training step of an LLM
neutral sd gradio dev space
measures CO2e generated from a single query to an LLM