·
AI & ML interests
None yet
Recent Activity
Organizations
theblackcat102/qwen_image_2stage_2k_2k
0.9B • Updated • 28
theblackcat102/Qwen3-1.7B-Base-nq-baseline-scratch
Text Generation
• 2B • Updated • 3
theblackcat102/Qwen3-1.7B-nq-baseline-finetune-5epochs
Text Generation
• 2B • Updated • 3
theblackcat102/qwen3-4b-retool-sft
theblackcat102/amazon-Qwen3-1.7B-Base-v1-dpo-conv_title_5k-checkpoint-1300
Text Generation
• 2B • Updated • 3
theblackcat102/amazon-Qwen3-1.7B-Base-v1-dpo-conv_title_5k_step2-checkpoint-300
Text Generation
• 2B • Updated • 2
theblackcat102/amazon-Qwen3-4B-Base-v1-semantic-id-14k-checkpoint-6600
Text Generation
• 4B • Updated • 1
theblackcat102/llama-3.2-1b-instruct-allenai_wildguard_safety
Text Generation
• 1B • Updated • 6
theblackcat102/llama-3.2-1b-instruct-dart-math-uniform-conversations
Text Generation
• 1B • Updated • 3
theblackcat102/llama-3.2-1b-instruct-aya_dataset
Text Generation
• 1B • Updated • 4
theblackcat102/llama-3.2-1b-instruct-tulu-3-sft-personas-instruction-following
Text Generation
• 1B • Updated • 3
theblackcat102/llama-3.2-1b-instruct-Magicoder-Evol-Instruct-110K-multi
Text Generation
• 1B • Updated • 2
theblackcat102/amazon-Smollm3-3B-Base-v1-semantic-id-checkpoint-1300
Text Generation
• 3B • Updated • 1
theblackcat102/amazon-Qwen3-4B-Base-v1-semantic-id-checkpoint-1300
Text Generation
• 4B • Updated • 1
theblackcat102/amazon-Qwen3-1.7B-stage2-v2-mix-semantic-id-trial-2-no-high-level-checkpoint-4200
Text Generation
• 2B • Updated • 3
theblackcat102/amazon-Qwen3-1.7B-stage2-v2-mix-semantic-id-checkpoint-2800
Text Generation
• 2B • Updated • 2
theblackcat102/amazon-Qwen3-1.7B-stage2-v2-mix-semantic-id-trial-2-no-abner-checkpoint-3000
Text Generation
• 2B • Updated • 2
theblackcat102/amazon-gr-semantic-v2-iter-3300
Text Generation
• 2B • Updated • 3
theblackcat102/amazon-gr-chat-v1-iter-1600
Text Generation
• 2B • Updated • 4
theblackcat102/amazon-Qwen3-1.7B-Base-v0
Text Generation
• 2B • Updated • 4
theblackcat102/step_segmentation
Token Classification
• 0.1B • Updated • 2
theblackcat102/dense-whale-v3-stage1
37B • Updated • 1
theblackcat102/whale-v3-base-merged
Text Generation
• 37B • Updated • 7
theblackcat102/whale-v3-base-lora-1200
theblackcat102/whale-v3-base-lora-1500
theblackcat102/failed-llama-MoD-upcycling
1B • Updated • 7
theblackcat102/llama-vision-yt-11b-extraction-lora
theblackcat102/Nous-Hermes-2-Mixtral-8x7B-18m-DPO-raw-q4
Text Generation
• 30B • Updated • 5
theblackcat102/Nous-Hermes-2-Mixtral-8x7B-18m-DPO-raw
Text Generation
• 29B • Updated • 6
theblackcat102/Nous-Hermes-2-Mixtral-8x7B-20m-DPO-raw
Text Generation
• 32B • Updated • 4