-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 13 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 8 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 13 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 102
daje kang
daje
AI & ML interests
None yet
Recent Activity
liked a dataset 7 days ago
nvidia/Nemotron-Personas-Korea updated a dataset 26 days ago
daje/korean-tts-training published a dataset 26 days ago
daje/korean-tts-trainingOrganizations
Paper
-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 13 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 8 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 13 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 102
models 41
daje/whisper-v3-turbo-address
Automatic Speech Recognition • 0.8B • Updated • 2
daje/Qwen2-VL-7B-Instruct-fashion-product-images-small
8B • Updated • 3
daje/Meta-Llama-3.1-8B-Instruct-de-identification
8B • Updated • 1
daje/Qwen2.5-14B-Instruct-tools
Text Generation • 15B • Updated • 2
daje/model_0.0002_alpha-32_r-64
Updated • 111
daje/model_0.0002_alpha-8_r-16
Updated • 123
daje/model_5e-05_alpha-128_r-256
Updated • 361
daje/model_2e-4_alpha-8_r-16
Updated • 364
daje/model_Lora
Updated • 6
daje/model_2e-4
Updated • 240
datasets 20
daje/korean-tts-training
Viewer • Updated • 120 • 637 • 1
daje/korean-address-voice-v2
Viewer • Updated • 3.74k • 12
daje/korean-address-voice
Viewer • Updated • 118 • 8
daje/synthetic-ko-sql-hard-add-llm-result
Viewer • Updated • 1.68k • 11
daje/synthetic-ko-sql-hard
Viewer • Updated • 1.68k • 10 • 1
daje/kotext-to-sql-v1-hard
Viewer • Updated • 2k • 13
daje/kaggle-image-datasets
Viewer • Updated • 44.4k • 17
daje/de-identify-chat-ko
Viewer • Updated • 9.92k • 14
daje/ko-hatefulmemes_train_8500
Viewer • Updated • 8.2k • 54
daje/ko-hatefulmemes_train_8500_kmhas
Viewer • Updated • 95.3k • 16