Seoul National University VLSI Lab

university

Efficient AI

jiwonsong authored a paper about 1 month ago

dongwonjo authored a paper about 1 month ago

jiwonsong submitted a paper about 1 month ago

CompactAttention: Accelerating Chunked Prefill with Block-Union KV Selection

RelayGen: Intra-Generation Model Switching for Efficient Reasoning

Organization Card

Edit this README.md markdown file to author your organization card.

None public yet

None public yet