Imagen 官网

官网入口

AI功能描述：Google AI文字到图像生成模型

说明: 官网入口 官方网站主页； IOS App Store 下载，支持 iPhone/iPad/Mac；安卓 Google Play / 应用宝下载； 客户端 Mac/Windows/iOS/Android 官方下载；插件浏览器插件（默认 Chrome）； GitHub / HuggingFace / ModelScope 模型或项目托管地址； API 模型/软件接口地址； MCP 官网的 MCP 栏目入口。若未显示，表示暂无对应渠道，欢迎补充或纠错。

AIGC官网收录 │ 2023-11-13 │ 970 次 │ 人工核对 │ 官网认证 │ 定期更新 │ AI大模型

Imagen 图文介绍：

我们展示了Imagen，这是一个文本到图像的扩散模型，具有前所未有的真实感和深入的语言理解。Imagen建立在大型转换器语言模型理解文本的能力之上，并取决于高保真图像生成中扩散模型的强度。我们的关键发现是，在纯文本语料库上预训练的通用大型语言模型（例如T5）在编码用于图像合成的文本方面出奇地有效：增加Imagen中语言模型的大小比增加图像扩散模型的大小更能提高样本保真度和图像-文本对齐。Imagen在COCO数据集上获得了7.27的新的最先进的FID分数，而从未对COCO进行过训练，并且人类评分者发现Imagen样本在图像-文本对齐方面与COCO数据本身不相上下。为了更深入地评估文本到图像模型，我们引入了DrawBench，这是一个全面且具有挑战性的文本到图像建模基准。使用DrawBench，我们将Imagen与最近的方法进行了比较，包括VQ-GAN+CLIP、潜在扩散模型和DALL-E2，并发现在并排比较中，无论是在样本质量还是图像文本对齐方面，评分者都更喜欢Imagen而不是其他模型。
We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation. Our key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis: increasing the size of the language model in Imagen boosts both sample fidelity and image-text alignment much more than increasing the size of the image diffusion model. Imagen achieves a new state-of-the-art FID score of 7.27 on the COCO dataset, without ever training on COCO, and human raters find Imagen samples to be on par with the COCO data itself in image-text alignment. To assess text-to-image models in greater depth, we introduce DrawBench, a comprehensive and challenging benchmark for text-to-image models. With DrawBench, we compare Imagen with recent methods including VQ-GAN+CLIP, Latent Diffusion Models, and DALL-E 2, and find that human raters prefer Imagen over other models in side-by-side comparisons, both in terms of sample quality and image-text alignment.

©️版权声明：
本网站(AIGC官网)刊载的所有内容，包括文字、图片、音频、视频等均在网上搜集。
访问者可将本网站提供的内容或服务用于个人学习、研究或欣赏，以及其他非商业性或非盈利性用途，但同时应遵守著作权法及其他相关法律的规定，不得侵犯本网站及相关权利人的合法权利。除此以外，将本网站任何内容或服务用于其他用途时，须征得本网站及相关权利人的书面许可，并支付报酬。
本网站内容原作者如不愿意在本网站刊登内容，请及时通知本站，予以删除。