CARVIEW |
Select Language
HTTP/2 200
date: Tue, 22 Jul 2025 03:26:46 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"5aa002c9d30bf44a593110edf5f1080b"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=grxFMKiPA1rBoQ7ggP8957qfedjrENPViQ2pOirQWadJSoD4KVHjmlhcFjS0chaAraNRoX4XAkhltG%2FitCXZPbsr%2FH%2BYLnlj6N6kfub0LOoOAoK1fgnZuDoCtbDKsI3pTAz%2BAUVM9%2FajmrZi8jQerozBy03j8sAhbuTYDWnL9iVIwTxRTt6Df1MMfxtjdnkvDOi1VMB6YqhfW%2B%2F7VkXJwQv1emNU1n7aSKLV3w7IVDgmlQE5lG8C1OsIpYOc3GWT2HtxP9021L3wJEtVGRQq6Q%3D%3D--oWI5%2F5Fuuefi%2BW26--uExE8AjN9Fznhbe1eErXqg%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.658334540.1753154806; Path=/; Domain=github.com; Expires=Wed, 22 Jul 2026 03:26:46 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Wed, 22 Jul 2026 03:26:46 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: 8C00:3044:D1634:12DBD9:687F04F6
Tags · PaddlePaddle/PaddleMIX · GitHub
Toggle v3.0.0-beta's commit message
Toggle v2.0.0's commit message
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 217
Tags: PaddlePaddle/PaddleMIX
Tags
v3.0.0-beta
2025.05.09 发布PaddleMIX 3.0.0-beta 多模态理解 - 新增模型:Qwen2VL/Qwen2.5VL系列,DeepSeek-VL2, miniCPM-V 2.6, Janus系列,LLaVA-Critic, LLaVA-DenseConnector, LLaVA-OneVision, GOT-OCR2.0, mPLUG-Owl3 - PP系列模型:发布自研PP-DocBee文档理解多模态大模型,在学术界权威的英文文档理解评测榜单上达到同参数量级别模型SOTA - 工具链升级:完善高性能推理部署,新增支持Qwen2.5VL系列,A800推理性能较vllm领先11.5%。LLaVA、InternVL2模型训练和推理适配昇腾910B 多模态生成 - 新增模型:Open-MAGVIT2,文生视频模型CogVideoX, HunyuanVideo - PP系列模型:发布自研可控视频模型PP-VCtrl,支持在多种控制条件下的视频生成 - 工具链升级:发布ppdiffusers 0.29.1版本,新增对SD3 ControlNet和SD3.5的支持。SD3高性能推理性能打平TensorRT。SD3、SDXL模型LoRA训练和推理适配昇腾910B
v2.0.0
2.0(07/26/2024) 多模态理解 1. 新增模型:LLaVA: v1.5-7b, v1.5-13b, v1,6-7b,CogAgent, CogVLM, Qwen-VL, InternLM-XComposer2 2. 数据集增强:新增chatml_dataset图文对话数据读取方案,可自定义chat_template文件适配,支持混合数据集 3. 工具链升级:新增Auto模块,统一SFT训练流程,兼容全参数、lora训练。新增mixtoken训练策略,SFT吞吐量提升5.6倍。支持Qwen-VL,LLaVA推理部署,较torch推理性能提升2.38倍 多模态生成 1. 视频生成能力:支持Sora相关技术,支持DiT、SiT、UViT训练推理,新增NaViT、MAGVIT-v2模型; 新增视频生成模型SVD、Open Sora,支持模型微调和推理; 新增姿态可控视频生成模型AnimateAnyone、即插即用视频生成模型AnimateDiff、GIF视频生成模型Hotshot-XL; 2. 文生图模型库:新增高速推理文图生成模型LCM,适配SD/SDXL训练和推理; 3. 工具链升级:发布ppdiffusers 0.24.1版本,新增peft,accelerate后端; 权重加载/保存全面升级,支持分布式、模型切片、safetensors等场景。 4. 生态兼容:提供基于ppdiffusers开发的ComfyUI插件,支持了常见的模型加载转换、文生图、图生图、图像局部修改等任务。新增Stable Diffusion 1.5系列节点;新增Stable Diffusion XL系列节点。新增4个图像生成的workflow案例。 DataCopilot(多模态数据处理工具箱) 1. 多模态数据集类型MMDataset,支持加载和导出Json、H5、Jsonl等多种数据存储格式,内置并发(map, filter)数据处理接口等 2. 多模态数据格式工具,支持自定义数据结构,数据转换,离线格式检查 3. 多模态数据分析工具,支持基本的统计信息,数据可视化功能,以及注册自定义功能
You can’t perform that action at this time.