Posted 2026-02-03Updated 2026-04-16Review15 minutes read (About 2285 words)UniDiffuser论文链接 | GitHub#CVResearch-paperMulti-modalTransformerImage2TextDiffusionModelImgGen
Posted 2026-02-03Updated 2026-04-16Review18 minutes read (About 2709 words)Scalable Diffusion Models with TransformersScalable Diffusion Models with Transformers | ICCV 2023#CVResearch-paperTransformerDiffusionModelImgGenScalability
Posted 2026-02-02Updated 2026-04-16Review28 minutes read (About 4167 words)GR00T N1 An Open Foundation Model for Generalist Humanoid Robots论文链接 | NVIDIA, 2025#Research-paperMulti-modalVLMTransformerFoundationModelRoboticsDiffusionModelImitationLearningRobotLearningVLAHumanoidRobot
Posted 2025-03-11Updated 2026-04-16Reviewa few seconds read (About 3 words)PHYSCENE- Physically Interactable 3D Scene Synthesis for Embodied AI#Research-paperDiffusionModel3D-SceneEmbodied-AIScene-synthesisPhysical-Scene