Posted 2026-02-03Updated 2026-03-01Review15 minutes read (About 2285 words)UniDiffuser论文链接 | GitHub#Research-paperCVMulti-modalTransformerImage2TextDiffusionModelImgGen
Posted 2026-02-03Updated 2026-03-01Review18 minutes read (About 2709 words)Scalable Diffusion Models with TransformersScalable Diffusion Models with Transformers | ICCV 2023#Research-paperCVTransformerDiffusionModelImgGenScalability
Posted 2026-02-02Updated 2026-03-01Review28 minutes read (About 4167 words)GR00T N1 An Open Foundation Model for Generalist Humanoid Robots论文链接 | NVIDIA, 2025#RoboticsResearch-paperMulti-modalVLMTransformerFoundationModelDiffusionModelVLARobotLearningImitationLearningHumanoidRobot
Posted 2025-03-11Updated 2026-03-01Reviewa few seconds read (About 3 words)PHYSCENE- Physically Interactable 3D Scene Synthesis for Embodied AI#Research-paperDiffusionModel3D-SceneEmbodied-AIScene-synthesisPhysical-Scene