2 7

Sebastian Nagy

Snagy22000

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago

Transformers.js v4 Preview: Now Available on NPM!

reacted to MonsterMMORPG's post with 🚀 16 days ago

Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model - https://youtu.be/DPX3eBTuO_Y This is a full comprehensive step-by-step tutorial for how to train Qwen Image models. This tutorial covers how to do LoRA training and full Fine-Tuning / DreamBooth training on Qwen Image models. It covers both the Qwen Image base model and the Qwen Image Edit Plus 2509 model. This tutorial is the product of 21 days of full R&D, costing over $800 in cloud services to find the best configurations for training. Furthermore, we have developed an amazing, ultra-easy-to-use Gradio app to use the legendary Kohya Musubi Tuner trainer with ease. You will be able to train locally on your Windows computer with GPUs with as little as 6 GB of VRAM for both LoRA and Fine-Tuning. Furthermore, I have shown how to train a character (person), a product (perfume) and a style (GTA5 artworks). Tutorial Link : https://youtu.be/DPX3eBTuO_Y

reacted to alvarobartt's post with 🔥 16 days ago

💥 `hf-mem` v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the `--experimental` flag! `uvx hf-mem --model-id ... --experimental` will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable. 💡 Alternatively, you can also set the `--max-model-len`, `--batch-size` and `--kv-cache-dtype` arguments (à la vLLM) manually if preferred.

View all activity

Organizations

upvoted an article 13 days ago

Article

Transformers.js v4 Preview: Now Available on NPM!

15 days ago

•

reacted to MonsterMMORPG's post with 🚀 16 days ago

Post

2734

Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model - https://youtu.be/DPX3eBTuO_Y

This is a full comprehensive step-by-step tutorial for how to train Qwen Image models. This tutorial covers how to do LoRA training and full Fine-Tuning / DreamBooth training on Qwen Image models. It covers both the Qwen Image base model and the Qwen Image Edit Plus 2509 model. This tutorial is the product of 21 days of full R&D, costing over $800 in cloud services to find the best configurations for training. Furthermore, we have developed an amazing, ultra-easy-to-use Gradio app to use the legendary Kohya Musubi Tuner trainer with ease. You will be able to train locally on your Windows computer with GPUs with as little as 6 GB of VRAM for both LoRA and Fine-Tuning. Furthermore, I have shown how to train a character (person), a product (perfume) and a style (GTA5 artworks).

Tutorial Link : https://youtu.be/DPX3eBTuO_Y

1 reply

reacted to alvarobartt's post with 🔥 16 days ago

Post

3040

💥 hf-mem v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the --experimental flag!

uvx hf-mem --model-id ... --experimental will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable.

💡 Alternatively, you can also set the --max-model-len, --batch-size and --kv-cache-dtype arguments (à la vLLM) manually if preferred.

1 reply

liked a model about 2 months ago

SebastianBodza/Kartoffelbox-v0.1

Text-to-Speech • Updated Jul 31, 2025 • 15 • 64

published a model 5 months ago

Snagy22000/distilbert-base-uncased-finetuned-emotion

Updated Sep 22, 2025

liked a dataset 7 months ago

mahmudulhasan01/baby_crying_sound

Viewer • Updated Jun 25, 2025 • 1.31k • 58 • 9

liked a model 8 months ago

Menlo/Jan-nano-128k

Text Generation • Updated Jul 1, 2025 • 2.4k • 221

liked 2 models 10 months ago

foduucom/baby-cry-classification

Audio Classification • Updated Jul 23, 2024 • 19

diarizers-community/speaker-segmentation-fine-tuned-callhome-deu

1.47M • Updated Apr 25, 2024 • 214 • 6

reacted to Xenova's post with 🔥 about 1 year ago

Post

8772

First project of 2025: Vision Transformer Explorer

I built a web app to interactively explore the self-attention maps produced by ViTs. This explains what the model is focusing on when making predictions, and provides insights into its inner workings! 🤯

Try it out yourself! 👇
webml-community/attention-visualization

Source code: https://github.com/huggingface/transformers.js-examples/tree/main/attention-visualization

reacted to Xenova's post with 🔥 over 1 year ago

Post

15225

I'm excited to announce that Transformers.js V3 is finally available on NPM! 🔥 State-of-the-art Machine Learning for the web, now with WebGPU support! 🤯⚡️

Install it from NPM with:
𝚗𝚙𝚖 𝚒 @𝚑𝚞𝚐𝚐𝚒𝚗𝚐𝚏𝚊𝚌𝚎/𝚝𝚛𝚊𝚗𝚜𝚏𝚘𝚛𝚖𝚎𝚛𝚜

or via CDN, for example: https://v2.scrimba.com/s0lmm0qh1q

Segment Anything demo: webml-community/segment-anything-webgpu