Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models Paper • 2407.11282 • Published Jul 15, 2024 • 1
CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs Paper • 2605.08467 • Published 20 days ago