Oleg Lavrovsky's picture

Oleg Lavrovsky PRO

loleg

·

AI & ML interests

Supporting Apertus team / Organizing hackathons / Engaged for open data

Recent Activity

liked a model about 5 hours ago

mistralai/Mistral-Medium-3.5-128B

upvoted an article about 18 hours ago

Announcing MamayLM, an efficient state-of-the-art Ukrainian LLM

reacted to hannayukhymenko's post with 🔥 about 18 hours ago

🚀 We are delighted to announce MamayLM, a new state-of-the-art efficient Ukrainian LLM! 📈 MamayLM surpasses similar-sized models in both English and Ukrainian, while matching or overtaking up to 10x larger models. 📊 MamayLM is a 9B model that can run on a single GPU, enabling cost-efficient AI autonomy and adoption across sectors in Ukraine such as education, legal, healthcare, public services and others (e.g., by specializing it to particular use cases). MalayLM is also attractive for organizations wishing to preserve data privacy as it s efficiency allows it to run on a local machine. 🧠 MamayLM is trained on high-quality Ukrainian data and understands Ukrainian language, culture, and history. It is built on top of Google’s Gemma 2 9B model, but uses a number of new advances stemming from INSAIT’s experience in creating BgGPT, a Bulgarian LLM we released last year, now adopted nationwide and profiled several times by Google as a worldwide success case. 🤝 MamayLM is developed in a collaboration between researchers at INSAIT and ETH Zürich and is trained entirely via donations to INSAIT for AI compute resources. 📥 MamayLM is now freely available to download on INSAIT’s HuggingFace in both full and quantized versions. We also publicly release all Ukrainian benchmarks we evaluated on. 📝 Further, we release blog posts in both English and Ukrainian, sharing our approach to creating MamayLM, hoping to drive further improvements by the community. 🌎 The release of LLMs for various languages is part of INSAIT’s mission in ensuring countries can achieve AI autonomy in a cost-efficient, controlled, safe and predictable manner. MamayLM model and benchmarks: https://huggingface.co/INSAIT-Institute Blog (EN): https://huggingface.co/blog/INSAIT-Institute/mamaylm Blog (UKR): https://huggingface.co/blog/INSAIT-Institute/mamaylm-ukr

View all activity

Organizations

New activity in swiss-ai/apertus-pretrain-swiss 15 days ago

[bot] Conversion to Parquet

#1 opened 10 months ago by

parquet-converter

New activity in swiss-ai/Apertus-8B-Instruct-2509 17 days ago

not compatible to newest LM-Studio eg llama.cpp 1.52

#20 opened 7 months ago by

Unexpected response with Llama.cpp + Mrader's quant

#23 opened 4 months ago by

Apertus tool parser

#18 opened 8 months ago by

New activity in swiss-ai/Apertus-70B-Instruct-2509 17 days ago

Fix to accept both tool format chat/completion and responses

#11 opened 7 months ago by

New activity in swiss-ai/Apertus-8B-Instruct-2509 17 days ago

My private, local home assistant tells my son a story

#24 opened 3 months ago by

iPhone App?

#25 opened about 2 months ago by

Apertus Tool Calling: Practical Notes

#26 opened about 1 month ago by

New activity in swiss-ai/Apertus-70B-Instruct-2509 17 days ago

Support for AMD GPU

#12 opened 6 months ago by

GPU-Setup

#14 opened 4 months ago by

Quantizing with Bits and Bytes Config?

#15 opened 4 months ago by

Request: DOI

#16 opened 2 months ago by deleted

New activity in swiss-ai/Apertus-70B-2509 17 days ago

Error: fatal: expected 'packfile'

#7 opened about 2 months ago by

Good project -- just don't compare to proprietary Llama

#8 opened about 2 months ago by

New activity in swiss-ai/Apertus-8B-2509 17 days ago

LM Studio compatibility?

#2 opened 8 months ago by

Request: DOI

#3 opened 8 months ago by

Chat template missing in README example

#4 opened 5 months ago by

Correcting the link for the report. The previous one is obsolete.

#6 opened 18 days ago by

New activity in swiss-ai/Apertus-8B-Instruct-2509 17 days ago

Correcting the link for the report. The previous one is obsolete.

#27 opened 18 days ago by

New activity in swiss-ai/Apertus-70B-2509 17 days ago

Correcting the link for the report. The previous one is obsolete.

#9 opened 18 days ago by