喵ID:PSk4GL免责声明

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Gemini 1.5:解锁跨数百万个上下文标记的多模式理解

基本信息

DOI:
10.48550/arxiv.2403.05530
发表时间:
2024
期刊:
ArXiv
影响因子:
--
通讯作者:
O. Vinyals
中科院分区:
文献类型:
--
作者: Machel Reid;Nikolay Savinov;Denis Teplyashin;Dmitry Lepikhin;T. Lillicrap;Jean;Radu Soricut;Angeliki Lazaridou;Orhan Firat;Julian Schrittwieser;Ioannis Antonoglou;Rohan Anil;Sebastian Borgeaud;Andrew M. Dai;Katie Millican;Ethan Dyer;Mia Glaese;Thibault Sottiaux;Benjamin Lee;Fabio Viola;Malcolm Reynolds;Yuanzhong Xu;James Molloy;Jilin Chen;M. Isard;Paul Barham;Tom Hennigan;Ross McIlroy;Melvin Johnson;J. Schalkwyk;Eli Collins;Eliza Rutherford;Erica Moreira;Kareem W. Ayoub;Megha Goel;Clemens Meyer;Gregory Thornton;Zhen Yang;H. Michalewski;Zaheer Abbas;Nathan Schucher;Ankesh Anand;Richard Ives;James Keeling;Karel Lenc;Salem Haykal;Siamak Shakeri;Pranav Shyam;Aakanksha Chowdhery;Roman Ring;Stephen Spencer;Eren Sezener;Luke Vilnis;Oscar Chang;Nobuyuki Morioka;George Tucker;Ce Zheng;Oliver Woodman;Nithya Attaluri;Tomás Kociský;Evgenii Eltyshev;Xi Chen;Timothy Chung;Vittorio Selo;Siddhartha Brahma;Petko Georgiev;Ambrose Slone;Zhenkai Zhu;James Lottes;Siyuan Qiao;Ben Caine;Sebastian Riedel;Alex Tomala;Martin Chadwick;J Christopher Love;Peter Choy;Sid Mittal;N. Houlsby;Yunhao Tang;Matthew Lamm;Libin Bai;Qiao Zhang;Luheng He;Yong Cheng;Peter Humphreys;Yujia Li;Sergey Brin;Albin Cassirer;Ying;Lukás Zilka;Taylor Tobin;Kelvin Xu;Lev Proleev;Daniel Sohn;Alberto Magni;Lisa Anne Hendricks;Isabel Gao;Santiago Ontan'on;Oskar Bunyan;Nathan Byrd;Abhanshu Sharma;Biao Zhang;Mario Pinto;Rishika Sinha;Harsh Mehta;Dawei Jia;Sergi Caelles;Albert Webson;Alex Morris;Becca Roelofs;Yifan Ding;Robin Strudel;Xuehan Xiong;Marvin Ritter;Mostafa Dehghani;Rahma Chaabouni;Abhijit Karmarkar;Guangda Lai;Fabian Mentzer;Bibo Xu;YaGuang Li;Yujing Zhang;T. Paine;Alex Goldin;Behnam Neyshabur;Kate Baumli;Anselm Levskaya;Michael Laskin;Wenhao Jia;Jack W. Rae;Kefan Xiao;Antoine He;Skye Giordano;Lakshman Yagati;Jean;Paul Natsev;Sanjay Ganapathy;Fangyu Liu;Danilo Martins;Nanxin Chen;Yunhan Xu;Megan Barnes;Rhys May;Arpi Vezer;Junhyuk Oh;Ken Franko;Sophie Bridgers;Ruizhe Zhao;Boxi Wu;Basil Mustafa;Sean Sechrist;Emilio Parisotto;Thanumalayan Sankaranarayana Pillai;Chris Larkin;Chenjie Gu;Christina Sorokin;M. Krikun;Alexey Guseynov;Jessica Landon;Romina Datta;A. Pritzel;Phoebe Thacker;Fan Yang;Kevin Hui;A.E. Hauth;Chih;David Barker;J. Mao;Sophia Austin;Hannah Sheahan;Parker Schuh;James Svensson;Rohan Jain;V. Ramasesh;Anton Briukhov;Da;Tamara von Glehn;Christina Butterfield;Priya Jhakra;Matt Wiethoff;Justin Frye;Jordan Grimstad;Beer Changpinyo;Charline Le Lan;Anna Bortsova;Yonghui Wu;P. Voigtlaender;Tara N. Sainath;Charlotte Smith;Will Hawkins;Kris Cao;James Besley;S. Srinivasan;Mark Omernick;Colin Gaffney;G. Surita;Ryan Burnell;Bogdan Damoc;Junwhan Ahn;Andrew Brock;Mantas Pajarskas;Anastasia Petrushkina;Seb Noury;Lorenzo Blanco;Kevin Swersky;Arun Ahuja;Thi Avrahami;Vedant Misra;Raoul de Liedekerke;Mariko Iinuma;A. Polozov;Sarah York;George van den Driessche;Paul Michel;Justin Chiu;Rory Blevins;Zach Gleicher;Adrià Recasens;Alban Rrustemi;E. Gribovskaya;Aurko Roy;Wiktor Gworek;S'ebastien M. R. Arnold;Lisa Lee;James Lee;Marcello Maggioni;Enrique Piqueras;Kartikeya Badola;S. Vikram;Lucas Gonzalez;Anirudh Baddepudi;Evan Senter;J. Devlin;James Qin;Michael Azzam;Maja Trebacz;M. Polacek;Kashyap Krishnakumar;Shuo;Matthew Tung;Ivo Penchev;Rishabh Joshi;Kate Olszewska;Carrie Muir;Mateo Wirth;Ale Jakse Hartman;Joshua Newlan;S. Kashem;Vijay Bolina;Elahe Dabir;Joost R. van Amersfoort;Zafarali Ahmed;James Cobon;Aishwarya B Kamath;A. M. Hrafnkelsson;Le Hou;Ian Mackinnon;Alexandre Frechette;Eric Noland;Xiance Si;E. Taropa;Dong Li;Phil Crone;Anmol Gulati;S'ebastien Cevey;Jonas Adler;Ada Ma;David Silver;Simon Tokumine;Richard Powell;Stephan Lee;Michael B. Chang;Samer Hassan;Diana Mincu;Antoine Yang;Nir Levine;Jenny Brennan;Mingqiu Wang;Sarah Hodkinson;Jeffrey Zhao;Josh Lipschultz;Aedan Pope;Michael B. Chang;Cheng Li;Laurent El Shafey;M. Paganini;Sholto Douglas;Bernd Bohnet;Fabio Pardo;Seth Odoom;Mihaela Rosca;Cicero Nogueira dos Santos;Kedar Soparkar;A. Guez;Tom Hudson;Steven Hansen;Chulayuth Asawaroengchai;Ravichandra Addanki;Tianhe Yu;Wojciech Stokowiec;Mina Khan;Justin Gilmer;Jaehoon Lee;Carrie Grimes Bostock;Keran Rong;Jonathan Caton;Pedram Pejman;Filip Pavetic;Geoff Brown;Vivek Sharma;Mario Luvci'c;Rajkumar Samuel;Josip Djolonga;Amol Mandhane;Lars Lowe Sjosund;Elena Buchatskaya;Elspeth White;Natalie Clay;Jiepu Jiang;Hyeontaek Lim;Ross Hemsley;Jane Labanowski;Nicola De Cao;David Steiner;Sayed Hadi Hashemi;Jacob Austin;Anita Gergely;Tim Blyth;Joe Stanton;K. Shivakumar;Aditya Siddhant;Anders Andreassen;Carlos L. Araya;Nikhil Sethi;Rakesh Shivanna;Steven Hand;Ankur Bapna;A. Khodaei;Antoine Miech;Garrett Tanzer;Andy Swing;S. Thakoor;Zhufeng Pan;Zachary Nado;Stephanie Winkler;Dian Yu;Mohammad Saleh;Lorenzo Maggiore;Iain Barr;Minh Giang;Thais Kagohara;Ivo Danihelka;Amit Marathe;Vladimir Feinberg;Mohamed Elhawaty;Nimesh Ghelani;Dan Horgan;Helen Miller;Lexi Walker;Richard Tanburn;Mukarram Tariq;Disha Shrivastava;Fei Xia;Chung;Zoe C. Ashwood;Khuslen Baatarsukh;Sina Samangooei;Fred Alcober;Axel Stjerngren;P. Komarek;Katerina Tsihlas;Anudhyan Boral;Ramona Comanescu;Jeremy Chen;Ruibo Liu;Dawn Bloxwich;Charlie Chen;Yanhua Sun;Fangxiaoyu Feng;M. Mauger;Xerxes Dotiwalla;V. Hellendoorn;Michael Sharman;Ivy Zheng;Krishna Haridasan;Gabriel Barth;Craig Swanson;Dominika Rogozi'nska;Alek Andreev;P. Rubenstein;Ruoxin Sang;Dan Hurt;Gamaleldin Elsayed;Ren;Dave Lacey;Anastasija Ili'c;Yao Zhao;Lora Aroyo;Chimezie Iwuanyanwu;Vitaly Nikolaev;Balaji Lakshminarayanan;Sadegh Jazayeri;Raphael Lopez Kaufman;Mani Varadarajan;Chetan Tekur;Doug Fritz;Misha Khalman;David Reitter;Kingshuk Dasgupta;Shourya Sarcar;T. Ornduff;Javier Snaider;Fantine Huot;Johnson Jia;Rupert Kemp;Nejc Trdin;Anitha Vijayakumar;Lucy Kim;Christof Angermueller;Li Lao;Tianqi Liu;Haibin Zhang;David Engel;Somer Greene;Anais White;Jessica Austin;Lilly Taylor;Shereen Ashraf;Dangyi Liu;Maria Georgaki;Irene Cai;Yana Kulizhskaya;Sonam Goenka;Brennan Saeta;Kiran Vodrahalli;Christian Frank;D. Cesare;Brona Robenek;Harry Richardson;Mahmoud Alnahlawi;Christopher Yew;Priya Ponnapalli;M. Tagliasacchi;Alex Korchemniy;Yelin Kim;Dinghua Li;B. Rosgen;Kyle Levin;Jeremy Wiesner;Praseem Banzal;Praveen Srinivasan;Hongkun Yu;cCauglar Unlu;David Reid;Zora Tung;D. Finchelstein;Ravin Kumar;A. Elisseeff;Jin Huang;Ming Zhang;Rui Zhu;Ricardo Aguilar;Mai Gim'enez;Jiawei Xia;Olivier Dousse;W. Gierke;S. Yeganeh;Damion Yates;Komal Jalan;Lu Li;Eri Latorre;Duc Dung Nguyen;Ken Durden;Praveen Kallakuri;Yaxin Liu;Matthew Johnson;Tomy Tsai;Alice Talbert;Jasmine Liu;Alexander Neitz;Chen Elkind;Marco Selvi;Mimi Jasarevic;Livio Baldini Soares;Albert Cui;Pidong Wang;Alek Wenjiao Wang;Xinyu Ye;Krystal Kallarackal;Lucia Loher;Hoi Lam;Josef Broder;D. Holtmann;Nina Martin;Bramandia Ramadhana;Daniel Toyama;Mrinal Shukla;Sujoy Basu;Abhi Mohan;Nicholas Fernando;Noah Fiedel;Kim Paterson;Hui Li;Ankush Garg;Jane Park;Donghyun Choi;Diane Wu;Sankalp Singh;Zhishuai Zhang;Amir Globerson;Lily Yu;John Carpenter;F. D. C. Quitry;Carey Radebaugh;Chu;Alex Tudor;Prakash Shroff;Drew Garmon;Dayou Du;Neera Vats;Han Lu;Shariq Iqbal;A. Yakubovich;Nilesh Tripuraneni;James Manyika;Haroon Qureshi;Nan Hua;Christel Ngani;Maria Abi Raad;Hannah Forbes;Anna Bulanova;Jeff Stanway;Mukund Sundararajan;Victor Ungureanu;Colton Bishop;Yunjie Li;Balaji Venkatraman;Bo Li;Chloe Thornton;Salvatore Scellato;Nishesh Gupta;Yicheng Wang;Ian Tenney;Xihui Wu;Ashish Shenoy;Gabriel Carvajal;Diana Gage Wright;Ben Bariach;Zhuyun Xiao;Peter Hawkins;Sid Dalmia;Cl'ement Farabet;Pedro Valenzuela;Quan Yuan;Christoper A. Welty;Ananth Agarwal;Mianna Chen;Wooyeol Kim;Brice Hulse;Nandita Dukkipati;Adam Paszke;Andrew Bolt;Elnaz Davoodi;Kiam Choo;Jennifer Beattie;J. Prendki;Harsha Vashisht;Rebeca Santamaria;Luis C. Cobo;Jarek Wilkiewicz;David Madras;Ali Elqursh;Grant Uy;Kevin Ramirez;Matt Harvey;Tyler Liechty;H. Zen;Jeff Seibert;Clara Huiyi Hu;A. Ya. Khorlin;Maigo Le;A. Aharoni;Megan Li;Lily Wang;Sandeep Kumar;Alejandro Lince;Norman Casagrande;Jay Hoover;Dalia El Badawy;David Soergel;Denis Vnukov;Matt Miecnikowski;Jiří Šimša;Anna Koop;Praveen Kumar;Thibault Sellam;Daniel Vlasic;Samira Daruki;Nir Shabat;John Zhang;Guolong Su;Kalpesh Krishna;Jiageng Zhang;Jeremiah Liu;Yi Sun;Evan Palmer;Alireza Ghaffarkhah;Xi Xiong;Victor Cotruta;Michael Fink;Lucas Dixon;Ashwin Sreevatsa;Adrian Goedeckemeyer;Alek Dimitriev;Mohsen Jafari;Remi Crocker;Nicholas A Fitzgerald;Aviral Kumar;Sanjay Ghemawat;Ivan Philips;Frederick Liu;Yannie Liang;Rachel Sterneck;Alena Repina;Marcus Wu;Laura Knight;Marin Georgiev;Hyo Lee;Harry Askham;A. Chakladar;Annie Louis;C. Crous;Hardie Cate;Dessie Petrova;Michael Quinn;Denese Owusu;Achintya Singhal;Nan Wei;Solomon Kim;Damien Vincent;Milad Nasr;Christopher A. Choquette;Reiko Tojo;Shawn Lu;Diego de Las Casas;Yuchung Cheng;Tolga Bolukbasi;Katherine Lee;Saaber Fatehi;R. Ananthanarayanan;Miteyan Patel;C. Kaed;Jing Li;Jakub Sygnowski;S. Belle;Zhe Chen;Jaclyn Konzelmann;Siim Poder;Roopal Garg;Vinod Koverkathu;Adam Brown;Chris Dyer;Rosanne Liu;Azade Nova;Jun Xu;Slav Petrov;D. Hassabis;K. Kavukcuoglu;Jeffrey Dean;O. Vinyals研究方向: -- MeSH主题词: --
关键词: --
来源链接:pubmed详情页地址

文献摘要

In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
在本报告中,我们介绍Gemini 1.5系列模型,它代表了下一代具有高计算效率的多模态模型,能够对来自数百万个上下文标记的细粒度信息进行回忆和推理,包括多个长文档以及数小时的视频和音频。该系列包括两个新模型:(1) 更新后的Gemini 1.5 Pro,在绝大多数能力和基准测试方面都超过了2月版本;(2) Gemini 1.5 Flash,这是一个更轻量级的变体,旨在提高效率的同时在质量上仅有最小程度的下降。Gemini 1.5模型在跨模态的长文本检索任务中实现了近乎完美的回忆,提升了长文档问答、长视频问答和长文本自动语音识别方面的最先进水平,并在广泛的基准测试中达到或超过了Gemini 1.0 Ultra的最先进性能。在研究Gemini 1.5长文本能力的极限时,我们发现它在接下来的标记预测方面持续改进,并且在至少1000万个标记上实现了近乎完美的检索(>99%),相比Claude 3.0(20万)和GPT - 4 Turbo(12.8万)等现有模型有了代际飞跃。最后,我们强调了实际应用案例,例如Gemini 1.5与专业人员合作完成任务,在10个不同的工作类别中节省了26%到75%的时间,以及处于前沿的大型语言模型令人惊讶的新能力;当给定一本卡拉芒语(一种全球使用者不足200人的语言)的语法手册时,该模型学会了将英语翻译成卡拉芒语,其水平与从相同内容学习的人相似。
参考文献(0)
被引文献(198)

数据更新时间:{{ references.updateTime }}

O. Vinyals
通讯地址:
--
所属机构:
--
电子邮件地址:
--
免责声明免责声明
1、猫眼课题宝专注于为科研工作者提供省时、高效的文献资源检索和预览服务;
2、网站中的文献信息均来自公开、合规、透明的互联网文献查询网站,可以通过页面中的“来源链接”跳转数据网站。
3、在猫眼课题宝点击“求助全文”按钮,发布文献应助需求时求助者需要支付50喵币作为应助成功后的答谢给应助者,发送到用助者账户中。若文献求助失败支付的50喵币将退还至求助者账户中。所支付的喵币仅作为答谢,而不是作为文献的“购买”费用,平台也不从中收取任何费用,
4、特别提醒用户通过求助获得的文献原文仅用户个人学习使用,不得用于商业用途,否则一切风险由用户本人承担;
5、本平台尊重知识产权,如果权利所有者认为平台内容侵犯了其合法权益,可以通过本平台提供的版权投诉渠道提出投诉。一经核实,我们将立即采取措施删除/下架/断链等措施。
我已知晓