Grassroots Science

community

https://grassroots.science

GrassrootsSci

grassroots-science

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ljvmiranda921 authored a paper 1 day ago

FilBench: Can LLMs Understand and Generate Filipino?

afaji authored a paper 2 months ago

Predicting the Order of Upcoming Tokens Improves Language Modeling

gentaiscool authored a paper 4 months ago

Language Surgery in Multilingual Large Language Models

View all activity

ljvmiranda921

authored a paper 1 day ago

FilBench: Can LLMs Understand and Generate Filipino?

Paper • 2508.03523 • Published Aug 5

afaji

authored a paper 2 months ago

Predicting the Order of Upcoming Tokens Improves Language Modeling

Paper • 2508.19228 • Published Aug 26 • 22

gentaiscool

authored a paper 4 months ago

Language Surgery in Multilingual Large Language Models

Paper • 2506.12450 • Published Jun 14 • 16

ljvmiranda921

authored a paper 5 months ago

R3: Robust Rubric-Agnostic Reward Models

Paper • 2505.13388 • Published May 19 • 11

afaji

authored a paper 6 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8 • 8

gentaiscool

authored a paper 6 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8 • 8

yongzx

authored 4 papers 6 months ago

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Paper • 2406.10118 • Published Jun 14, 2024 • 32

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 76

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks

Paper • 2410.18210 • Published Oct 23, 2024

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8 • 8

afaji

authored a paper 6 months ago

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

Paper • 2504.20966 • Published Apr 29 • 33

ljvmiranda921

authored 2 papers 8 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 41

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 101

afaji

authored a paper 8 months ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 101

yongzx

authored a paper 8 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 19

ljvmiranda921

authored 2 papers 10 months ago

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 10

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

gentaiscool

authored 3 papers 11 months ago

Attention-Based LSTM for Psychological Stress Detection from Spoken Language Using Distant Supervision

Paper • 1805.12307 • Published May 31, 2018 • 1

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

Paper • 2309.10661 • Published Sep 19, 2023 • 1

XPersona: Evaluating Multilingual Personalized Chatbot

Paper • 2003.07568 • Published Mar 17, 2020

AI & ML interests

Recent Activity

Team members 6

grassroots-science's activity