MLCommons
MLCommons is an AI engineering consortium, built on a philosophy of open collaboration to improve AI systems. Through our collective engineering efforts, we continually measure and improve AI technologies' accuracy, safety, speed, and efficiency.
- Real Shopify data in MLPerf v6.0. Typos, multilingual text, embedded HTML—all the complexity production AI faces at scale. No synthetic data. Submit Feb 13 bit.ly/4k9F5YS
- The AI Economy: Flywheels for Agentic Value Creation Luis Oala on the economics that matter: → Data flywheels → Token efficiency → Compounding advantages → Sustainable growth Watch: youtu.be/a8psNW72l3Q #Endpoints2025 #AIEconomy
- 1/4 🧵 First Qwen model in MLPerf. 40M products daily. Real production data from Shopify's e-commerce infrastructure. Submit by Feb 13, 2026 👇 #MLPerf #Shopify #VLM #MLCommons
-
View full thread3/4 "...We're contributing these foundations to the ecosystem so the next generation of AI is ready to power commerce at global scale." — Kshetrajna Raghavan, Principal Engineer ML
- 4/4 Multimodal AI → $10.89B by 2030 🛒 Retail CAGR: 34.6% ⚡ Production VLM benchmark for 2026 Hardware vendors, cloud providers: prove your stack. ⏰ Feb 13 deadline →https://mlcommons.org/2026/02/vlm-inference-shopify/
- 2/4 "Open infrastructure is essential to the future of agentic commerce and Shopify is building the systems to power it, from our open-source Standard Product Taxonomy to the Catalog benchmark.... — Kshetrajna Raghavan, Principal Engineer ML
- 🎥 The Reliability Basket: What is Risk Coverage? Panel on measuring what can go wrong with AI systems. Beyond accuracy → comprehensive risk assessment. Watch: youtu.be/UgETTwnz6kY #Endpoints2025 #AIReliability
- Processing 40 million products daily with 78.24% accuracy on noisy, multilingual catalog data. Not a lab benchmark—Shopify's actual production reality. Submit your VLM stack by Feb 13 → mlcommons.org/2026/02/vlm-inferen… #AIBenchmark
- 🚀 NEW: MLPerf Inference v6.0 debuts Qwen3-VL + Shopify Product Catalog benchmark 40M products daily. Real production data. First Qwen model in MLPerf. Submit by Feb 13, 2026 → bit.ly/4k9F5YS #MLPerf #VLM #Shopify #MLCommons