Add comprehensive LLM finetuning analysis with 7 visualization graphs 9d429ce havinashpatil commited on 7 days ago
Final hackathon submission: polished README + detailed blog writeup 90be6c7 havinashpatil commited on 7 days ago
Fix builtin syntax fixer for inline one-line headers and validate TGI fixes 0c0a8ff havinashpatil commited on 7 days ago
Fix HF Spaces TGI launch entrypoint to avoid unexpected argument error 27a8fcf havinashpatil commited on 7 days ago
Fix TGI endpoint plumbing and use full TGI runtime image in Dockerfile a8bc575 havinashpatil commited on 7 days ago
Add AI coding system with local Hugging Face LLM integration 271cc02 havinashpatil commited on 7 days ago
Update Dockerfile to build frontend for Spaces deployment da3a03d havinashpatil commited on 7 days ago
Serve React frontend from FastAPI backend for one-click access 62c7e95 havinashpatil commited on 7 days ago
Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment 03a7eb9 havinashpatil commited on 7 days ago
fix: clamp reward to [0.01,0.99] so .2f never rounds to 0.00 or 1.00 59fd9d3 havinashpatil commited on 8 days ago
fix: removed invalid openenv-py package from notebook install cell 82e39c9 havinashpatil commited on 8 days ago
chore: update dependencies and include training results for README 8599a81 havinashpatil commited on 8 days ago
Complete all tasks: Adaptive curriculum, GRPO, React frontend, LLM-as-a-judge a448db8 havinashpatil commited on 8 days ago
fix: reset task_id parsing, grader tuple crash fallback, and inference score output 646409d adityanaikhpt commited on 25 days ago
fix: exclusive score clamping (0.001, 0.999) for evaluator 2d8d38c adityanaikhpt commited on 25 days ago
Rewrite inference.py for strict OpenEnv parsing + add httpx eb60bd2 adityanaikhpt commited on 25 days ago
fix: add safe_reward() clamp at every reward return point f14f8d9 adityanaikhpt commited on 25 days ago
Minimal patch: standalone proxy ping + reward clamped to (0,1) 74bfde0 adityanaikhpt commited on 25 days ago
Add normalize_reward(); enforce strict (0,1) on all paths b53855f adityanaikhpt commited on 25 days ago
fix: use API_BASE_URL/API_KEY for LiteLLM proxy β always make API call (Phase 2) 51fdbe8 adityanaikhpt commited on 25 days ago
fix: make inference.py crash-proof when OPENAI_API_KEY is missing (Phase 2) 1fe26af adityanaikhpt commited on 25 days ago
fix: OpenEnv multi-mode compliance β add main() entrypoint and uv.lock e92bfc1 adityanaikhpt commited on 25 days ago
Production-ready: add server/app.py with fallback-safe /reset, fix Dockerfile, add HF metadata, add task JSON files dcc8fa3 adityanaikhpt commited on 25 days ago