DescriptionPerformance of AI models on various benchmarks from 1998 to 2024.png | English: Figure 2. Performance of AI models on various benchmarks from 1998 to 2024, including computer vision (MNIST, ImageNet), speech recognition (Switchboard), natural language understanding (SQuAD 1.1, MMLU, GLUE), general language model evaluation (MMLU, Big-Bench, and GPQA), and mathematical reasoning (MATH). Many models surpass human-level performance (black solid line) by 2024, demonstrating significant advancements in AI capabilities across different domains over the past two decades. Data are from (94) for MNSIT, Switchboard, ImageNet, SQuAD 1.1, 2 and GLUE. Data for MMLU, Big Bench, GPQA are from the relevant papers (95, 96, 97). |
Author | CHAIR Prof. Yoshua Bengio, Université de Montréal / Mila - Quebec AI Institute EXPERT ADVISORY PANEL Prof. Bronwyn Fox, The Commonwealth Scientific and Industrial Research Organisation (CSIRO) (Australia) André Carlos Ponce de Leon Ferreira de Carvalho, Institute of Mathematics and Computer Sciences, University of São Paulo (Brazil) Dr. Mona Nemer, Chief Science Advisor of Canada (Canada) Raquel Pezoa Rivera, Federico Santa María Technical University (Chile) Dr. Yi Zeng, Institute of Automation, Chinese Academy of Sciences (China) Juha Heikkilä, DG Connect (European Union) Guillaume Avrin, General Directorate of Enterprises (France) Prof. Antonio Krüger, German Research Center for Artificial Intelligence (Germany) Prof. Balaraman Ravindran, Indian Institute of Technology, Madras (India) Prof. Hammam Riza, KORIKA (Indonesia) Dr. Ciarán Seoighe, Science Foundation Ireland (Ireland) Dr. Ziv Katzir, Israel Innovation Authority (Israel) Dr. Andrea Monti, University of Chieti-Pescara (Italy) Dr. Hiroaki Kitano, Sony Group (Japan) [Interim] Mary Kerema, Ministry of Information Communications Technology and Digital Economy (Kenya) Dr. José Ramón López Portillo, Q Element (Mexico) Prof. Haroon Sheikh, Netherlands’ Scientific Council for Government Policy (Netherlands) Dr. Gill Jolly, Ministry of Business, Innovation and Employment (New Zealand) Dr. Olubunmi Ajala, Innovation and Digital Economy (Nigeria) Dominic Ligot, CirroLytix (Philippines) Prof. Kyoung Mu Lee, Department of Electrical and Computer Engineering, Seoul National University (Republic of Korea) Ahmet Halit Hatip, Turkish Ministry of Industry and Technology (Republic of Turkey) Crystal Rugege, National Center for AI and Innovation Policy (Rwanda) Dr. Fahad Albalawi, Saudi Authority for Data and Artificial Intelligence (Kingdom of Saudi Arabia) Denise Wong, Data Innovation and Protection Group, Infocomm Media Development Authority (IMDA) (Singapore) Dr. Nuria Oliver, ELLIS Alicante (Spain) Dr. Christian Busch, Federal Department of Economic Affairs, Education and Research (Switzerland) Oleksii Molchanovskyi, Expert Committee on the Development of Artificial intelligence in Ukraine (Ukraine) Marwan Alserkal, Ministry of Cabinet Affairs, Prime Minister’s Office (United Arab Emirates) Saif M. Khan, U.S. Department of Commerce (United States) Dame Angela McLean, Government Chief Scientific Adviser (United Kingdom) Amandeep Gill, UN Tech Envoy (United Nations) |