报告预览

谷歌：2025 Gemma 3 技术报告（英文版）（25页）.pdf

编号：879464

PDF 中文版 DOCX 25页 3.54MB 下载积分：VIP专享

下载报告请您先登录！

谷歌：2025 Gemma 3 技术报告（英文版）（25页）.pdf

1、2025-03-12Gemma 3 Technical ReportGemma Team,Google DeepMind1We introduce Gemma 3,a multimodal addition to the Gemma family of lightweight open models,rangingin scale from 1 to 27 billion parameters.This version introduces vision understanding abilities,a widercoverage of languages and longer contex

2、t at least 128K tokens.We also change the architecture ofthe model to reduce the KV-cache memory that tends to explode with long context.This is achieved byincreasing the ratio of local to global attention layers,and keeping the span on local attention short.The Gemma 3 models are trained with disti

3、llation and achieve superior performance to Gemma 2for both pre-trained and instruction finetuned versions.In particular,our novel post-training recipesignificantly improves the math,chat,instruction-following and multilingual abilities,making Gemma3-4B-IT competitive with Gemma2-27B-IT and Gemma3-2

4、7B-IT comparable to Gemini-1.5-Pro acrossbenchmarks.We release all our models to the community.1.IntroductionWe present the newest version of Gemma openlanguage models(Gemma Team,2024a),co-designed with the family of Gemini frontier mod-els(Gemini Team,2023).This new versioncomes in sizes comparable

5、 to Gemma 2(GemmaTeam,2024b),with the addition of a 1B model.These models are designed to run on standardconsumer-grade hardware such as phones,lap-tops,and high-end GPUs.This version comeswith several new abilities to the Gemma family;namely,multimodality,long context,and mul-tilinguality,while pre

6、serving or surpassing theperformance of prior versions.In terms of multimodality,most Gemma 3 mod-els are compatible with a tailored version of theSigLIP vision encoder(Zhai et al.,2023).Thelanguage models treat images as a sequence ofsoft tokens encoded by SigLIP.We reduce the in-ference cost of im

友情提示

1、下载报告失败解决办法
2、PDF文件下载后，可能会被浏览器默认打开，此种情况可以点击浏览器菜单，保存网页到桌面，就可以正常下载了。
3、本站不支持迅雷下载，请使用电脑自带的IE浏览器，或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩，下载后原文更清晰。

本文（谷歌：2025 Gemma 3 技术报告（英文版）（25页）.pdf）为本站（111111）主动上传，三个皮匠报告文库仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对上载内容本身不做任何修改或编辑。若此文所含内容侵犯了您的版权或隐私，请立即通知三个皮匠报告文库（点击联系客服），我们立即给予删除！

温馨提示：如果因为网速或其他原因下载失败请重新下载，重复下载不扣分。