摘要【翻译】通用人工智能的花火——GPT-4的早期研究 Sparks of Arti cial General Intelligence:Early experiments with GPT-4

摘要

  • Sébastien Bubeck,  Varun Chandrasekaran, Ronen Eldan,   
  • Johannes Gehrke,
  • Eric Horvitz,  
  • Ece Kamar,
  • Peter Lee,
  • Yin Tat Lee,
  • Yuanzhi Li,
  • Scott Lundberg,
  • Harsha Nori,
  • Hamid Palangi,
  • Marco Tulio Ribeiro, 
  • Yi Zhang

March 2023

Download BibTex

Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT-4 is part of a new cohort of LLMs (along with ChatGPT and Google’s PaLM for example) that exhibit more general intelligence than previous AI models.

人工智能(AI)研究人员一直在开发和完善大型语言模型(LLM),这些模型在各种领域和任务中表现出非凡的能力,挑战我们对学习和认知的理解。OpenAI 开发的最新模型 GPT-4 使用前所未有的计算和数据规模进行了训练。在本文中,我们报告了我们对 GPT-4 早期版本的调查,当时 OpenAI 仍在积极开发中。我们认为(这个早期版本)GPT-4是新的LLM队列的一部分(例如ChatGPT和Google的PaLM),它们比以前的AI模型表现出更多的通用智能。

We discuss the rising capabilities and implications of these models. We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and more, without needing any special prompting. Moreover, in all of these tasks, GPT-4’s performance is strikingly close to human-level performance, and often vastly surpasses prior models such as ChatGPT.

我们将讨论这些模型不断增强的功能和影响。我们证明,除了掌握语言之外,GPT-4 还可以解决跨越数学、编码、视觉、医学、法律、心理学等的新颖而困难的任务,而无需任何特殊提示。此外,在所有这些任务中,GPT-4的性能惊人地接近人类水平的性能,并且经常大大超过ChatGPT等以前的模型。

Given the breadth and depth of GPT-4’s capabilities, we believe that it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system. In our exploration of GPT-4, we put special emphasis on discovering its limitations, and we discuss the challenges ahead for advancing towards deeper and more comprehensive versions of AGI, including the possible need for pursuing a new paradigm that moves beyond next-word prediction. We conclude with reflections on societal influences of the recent technological leap and future research directions.

鉴于 GPT-4 功能的广度和深度,我们认为它可以合理地被视为通用人工智能 (AGI) 系统的早期(但仍然不完整)版本。在探索 GPT-4 时,我们特别强调发现其局限性,并讨论了迈向更深入、更全面的 AGI 版本所面临的挑战,包括可能需要追求超越下一个词预测的新范式。最后,我们反思了近期技术飞跃的社会影响和未来的研究方向。

你可能感兴趣的:(人工智能)