MaLA Corpus for Massive Language Adaptation of Large Language Models https://mala-lm.github.io
MaLA-LM
community
AI & ML interests
NLP & LLM
Recent Activity
View all activity
Organization Card
Welcome to MaLA-LM (Massive Language Adaptation of Large Language Models)! 🌍
MaLA-LM focuses on adapting large language models to support hundreds of languages, including many underrepresented ones. Our models are multilingual, scalable, and optimized for diverse linguistic tasks.
Featured 🗣️
Check out our multilingual LLM collections, featuring models trained to handle 500+ languages, ideal for global, multilingual applications.
Dive into the collections: EMMA-500 | MaLA corpus | MaLA-500
Join our Discord server 👋
https://discord.com/invite/F5mEb7U6we
Happy building! 🚀
models
59
MaLA-LM/lucky52-bloom-7b1-no-32
Text Generation
•
8B
•
Updated
•
3
MaLA-LM/emma-500-llama3.1-8b-bi
Text Generation
•
8B
•
Updated
•
3.74k
MaLA-LM/emma-500-llama3-8b-bi
Text Generation
•
8B
•
Updated
•
8
MaLA-LM/emma-500-llama3-8b-mono
Text Generation
•
8B
•
Updated
MaLA-LM/emma-500-llama3.1-8b-mono
Text Generation
•
8B
•
Updated
•
28
MaLA-LM/lucky52-bloom-7b1-no-3
Text Generation
•
8B
•
Updated
•
1
MaLA-LM/lucky52-bloom-7b1-no-2
Text Generation
•
8B
•
Updated
•
1
MaLA-LM/lucky52-bloom-7b1-no-4
Text Generation
•
8B
•
Updated
•
2
MaLA-LM/lucky52-bloom-7b1-no-5
Text Generation
•
8B
•
Updated
MaLA-LM/lucky52-bloom-7b1-no-6
Text Generation
•
8B
•
Updated
•
1
datasets
13
MaLA-LM/mala-opus-dedup-2410
Viewer
•
Updated
•
19M
•
474
•
3
MaLA-LM/mala-bilingual-translation-corpus
Viewer
•
Updated
•
16.5B
•
1.01k
•
6
MaLA-LM/mala-monolingual-integration
Viewer
•
Updated
•
2.14B
•
634
•
2
MaLA-LM/mala-monolingual-split
Viewer
•
Updated
•
825M
•
925
•
2
MaLA-LM/mala-monolingual-filter
Viewer
•
Updated
•
1.42B
•
7.74k
•
2
MaLA-LM/mala-monolingual-dedup
Viewer
•
Updated
•
969M
•
1.38k
•
2
MaLA-LM/mala-opus-dedup-2410-reLID
Viewer
•
Updated
•
62.5B
•
105
MaLA-LM/mala-opus-dedup-2410-sample
Viewer
•
Updated
•
9.5B
•
828
MaLA-LM/mala-code-reasoning-v2
Viewer
•
Updated
•
89.7M
•
65
•
5
MaLA-LM/mala-code-reasoning
Viewer
•
Updated
•
44.9M
•
52
•
4