James Wright PRO

jsfs11

jsfs11

AI & ML interests

ML engineering, SOTA techniques, LLM fine-tuning and merging

Recent Activity

liked a model 1 day ago

numind/NuExtract3

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Flash

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

View all activity

Organizations

None yet

upvoted 3 articles about 1 month ago

Article

🪆 Introduction to Matryoshka Embedding Models

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 208

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 60

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 71

upvoted a paper about 2 months ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published Apr 6 • 47

upvoted an article 2 months ago

Article

Build a Domain-Specific Embedding Model in Under a Day

nvidia

•

Mar 20

• 73

upvoted an article 4 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

upvoted an article 7 months ago

Article

Australian-made LLM beats OpenAI and Google at legal retrieval

isaacus

•

Oct 23, 2025

• 28

upvoted 2 articles 8 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll

•

Oct 1, 2025

• 144

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

tomaarsen, Xenova, alvarobartt, ariG23498, pcuenq, sergiopaniego

•

Sep 4, 2025

• 275

upvoted a paper 9 months ago

Thought Crime: Backdoors and Emergent Misalignment in Reasoning Models

Paper • 2506.13206 • Published Jun 16, 2025 • 1

upvoted 2 articles 11 months ago

Article

Sensitivity Aware Mixed Precision Quantization V1

badaoui

•

Jun 13, 2025

• 26

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

derekl35, marcsun13, sayakpaul, merve, linoyts

•

Jun 19, 2025

• 106

upvoted 2 papers 12 months ago

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Paper • 2505.21925 • Published May 28, 2025 • 37

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22, 2025 • 37

upvoted a paper about 1 year ago

Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness

Paper • 2310.02410 • Published Oct 3, 2023 • 3

upvoted a collection about 1 year ago

Llama 4

Collection

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated Apr 22 • 57

upvoted 2 papers over 1 year ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14, 2025 • 128

MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design

Paper • 2412.14590 • Published Dec 19, 2024 • 15

upvoted a collection over 1 year ago

Falcon3

Collection

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated Nov 6, 2025 • 93

upvoted a paper over 1 year ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115

James Wright PRO

AI & ML interests

Recent Activity

Organizations

jsfs11's activity

🪆 Introduction to Matryoshka Embedding Models

Multimodal Embedding & Reranker Models with Sentence Transformers

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Build a Domain-Specific Embedding Model in Under a Day

We Got Claude to Build CUDA Kernels and teach open models!

Australian-made LLM beats OpenAI and Google at legal retrieval

Introducing RTEB: A New Standard for Retrieval Evaluation

Welcome EmbeddingGemma, Google's new efficient embedding model

Sensitivity Aware Mixed Precision Quantization V1

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware