Le Lézard
Classified in: Science and technology

Announcing the Multilingual Conversational Speech Language Model (MLC-SLM) Challenge


MONROVIA, Calif., March 19, 2025 /PRNewswire/ -- Nexdata, a leading global provider of AI data services today announces the start of The Multilingual Conversational Speech LLM (MLC-SLM) Challenge, an officially approved satellite event of Interspeech 2025.

This challenge, hosted by Meta, Google, Samsung, Naver, China Mobile, Northwestern Polytechnical University and Nexdata, aims to advance multilingual conversational speech AI by providing a real-world dataset and encouraging innovation in speech language models.

The challenge consists of two tasks, both of which require participants to explore the development of speech language models (SLMs):

Task I: Multilingual Conversational Speech Recognition

Objective: Develop a multilingual LLM-based ASR model. Participants will be provided with oracle segmentation and speaker labels for each conversation.

Task II: Multilingual Conversational Speech Diarization and Recognition

Objective: Develop a system for both speaker diarization (identifying who is speaking when), and recognition (transcribing speech to text). No prior or oracle information will be provided during evaluation (e.g., no pre-segmented utterances or speaker labels). Both pipeline-based and end-to-end systems are encouraged, providing flexibility in system design and implementation.

The training set (Train) comprises approximately 11 languages: English (en), French (fr), German (de), Italian (it), Portuguese (pt), Spanish (es), Japanese (jp), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi). It's designed to provide a rich resource for training and evaluating multilingual conversational speech language models (MLC-SLM), addressing the challenges of linguistic diversity, speaker variability, and contextual understanding.

Important Dates (AOT Time)

March 10, 2025: Registration opens
March 15, 2025: Training data release
March 20, 2025: Development set and baseline system release
May 15, 2025: Evaluation set release and Leaderboard open
May 30, 2025: Leaderboard freeze and paper submission portal opens (CMT system)
June 15, 2025: Paper submission deadline
July 1, 2025: Notification of acceptance
August 18, 2025: Workshop date

We have set a prize pool of $20,000 for the winners. Based on performance, the top three teams in each track will be awarded:
1st Prize: $5,000
2nd Prize: $3,000
3rd Prize: $2,000

For more details, please check out the challenge website: https://www.nexdata.ai/competition/mlc-slm 

Participate here: https://docs.google.com/forms/d/e/1FAIpQLSftZCRQQWvO5NZd-bPo1VT2Xsaieu_ZYCklw6MhW6LqjWnuYQ/viewform?usp=send_form 

For inquiries: [email protected] 

Join us in shaping the future of multilingual conversational AI and be part of this groundbreaking challenge!

About Nexdata

Nexdata provides top-notch training data solutions and serves as your reliable partner. With an extensive array of off-the-shelf datasets and flexible data collection and annotation services, our mission revolves around unleashing AI's full potential and expediting the AI industry's growth.

SOURCE Nexdata


These press releases may also interest you

at 14:49
The Research Institute for Fragrance Materials (RIFM) congratulates Safety Assessment Team Researcher Marissa Guttenberg, PhD, on receiving the Society of Toxicology (SOT) 2025 Best Paper of the Year Award in the Immunotoxicology Specialty Section....

at 14:46
The Midwest's premier supply chain event, the Third-Annual Jarrett Supply Chain Summit, returns on Thursday, Aug. 7, at the Kent State University-Stark Conference Center in Canton, Ohio. Kicking off the event, attendees are invited to an exclusive...

at 14:41
As crypto mining continues to grow worldwide, Nigeria is emerging as one of the most strategic and cost-efficient locations for large-scale mining operations. With 2GW of energy allocated by the Nigerian government for crypto mining projects,...

at 14:38
According to the latest study from BCC Research, "Global Chiplets Market" will reach $42.8 billion by the end of 2029, growing at a CAGR of 41.9% from 2024 to 2029. This report focuses on five processor segments: CPUs, GPUs, FPGAs, AI-ASIC...

at 14:35
The sixth annual Bioelectronic Medicine Summit, hosted by Northwell Health's Feinstein Institutes for Medical Research, brought together leading scientists, engineers, clinicians and innovators in the fields of translational medicine, neuromodulation...

at 14:35
Justice Design News (JDN) is now online and instantly stands as the industry's most robust web platform solely dedicated to the design and construction of justice facilities in North America. "This vital new industry resource aims to be an...



News published on and distributed by: