🚀 Project🌐 remote
Open-Source Multilingual Evaluation Benchmark
Building a community benchmark suite that evaluates LLMs across 8 languages including Hindi, Arabic, Indonesian and Bengali - languages largely underrepresented in current benchmarks. Looking for NLP researchers and native speakers to contribute test sets.
open-source · benchmark · multilingual · researchexpires 2026-09-20