谷歌:2025 Gemma 3 技术报告(英文版)(25页).pdf

编号:879464 PDF  中文版  DOCX 25页 3.54MB 下载积分:VIP专享
下载报告请您先登录!

谷歌:2025 Gemma 3 技术报告(英文版)(25页).pdf

1、2025-03-12Gemma 3 Technical ReportGemma Team,Google DeepMind1We introduce Gemma 3,a multimodal addition to the Gemma family of lightweight open models,rangingin scale from 1 to 27 billion parameters.This version introduces vision understanding abilities,a widercoverage of languages and longer contex

2、t at least 128K tokens.We also change the architecture ofthe model to reduce the KV-cache memory that tends to explode with long context.This is achieved byincreasing the ratio of local to global attention layers,and keeping the span on local attention short.The Gemma 3 models are trained with disti

3、llation and achieve superior performance to Gemma 2for both pre-trained and instruction finetuned versions.In particular,our novel post-training recipesignificantly improves the math,chat,instruction-following and multilingual abilities,making Gemma3-4B-IT competitive with Gemma2-27B-IT and Gemma3-2

4、7B-IT comparable to Gemini-1.5-Pro acrossbenchmarks.We release all our models to the community.1.IntroductionWe present the newest version of Gemma openlanguage models(Gemma Team,2024a),co-designed with the family of Gemini frontier mod-els(Gemini Team,2023).This new versioncomes in sizes comparable

5、 to Gemma 2(GemmaTeam,2024b),with the addition of a 1B model.These models are designed to run on standardconsumer-grade hardware such as phones,lap-tops,and high-end GPUs.This version comeswith several new abilities to the Gemma family;namely,multimodality,long context,and mul-tilinguality,while pre

6、serving or surpassing theperformance of prior versions.In terms of multimodality,most Gemma 3 mod-els are compatible with a tailored version of theSigLIP vision encoder(Zhai et al.,2023).Thelanguage models treat images as a sequence ofsoft tokens encoded by SigLIP.We reduce the in-ference cost of im

友情提示

1、下载报告失败解决办法
2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。

本文(谷歌:2025 Gemma 3 技术报告(英文版)(25页).pdf)为本站 (111111) 主动上传,三个皮匠报告文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知三个皮匠报告文库(点击联系客服),我们立即给予删除!

温馨提示:如果因为网速或其他原因下载失败请重新下载,重复下载不扣分。
客服
商务合作
小程序
服务号
折叠