🔥 Diffusion Model <-> Depth 🔥👉ETH & CMU on how to turn a single-image latent diffusion model (LDM) into the SOTA video depth estimator: video depth without video models. Repo released under Apache 2.0 and HF demo available💙👉Review https://t.ly/sP9ma👉Paper arxiv.org/pdf/2411.19189👉Project rollingdepth.github.io/👉Repo github.com/prs-eth/rollingdepth🤗Demo huggingface.co/spaces/prs-eth/rollingdepthhttps://t.ly/sP9ma

🔥 S3MOT: SOTA 3D MOT 🔥👉S3MOT: Selective-State-Space model-based MOT that efficiently infers 3D motion and object associations from 2D images through three core components. New SOTA on KITTI with 76.86 HOTA at 31 FPS! Code & Weights to be released under MIT license💙👉Review https://t.ly/H_JPv👉Paper https://arxiv.org/pdf/2504.18068👉Repo https://github.com/bytepioneerX/s3mot

🌼SOTA Textured 3D-Guided VTON🌼👉#ALIBABA unveils 3DV-TON, a novel diffusion model for HQ and temporally consistent video. Generating animatable textured 3D meshes as explicit frame-level guidance, alleviating the issue of models over-focusing on appearance fidelity at the expanse of motion coherence. Code & benchmark to be released💙👉Review https://t.ly/0tjdC👉Paper https://lnkd.in/dFseYSXz👉Project https://lnkd.in/djtqzrzs👉Repo TBA

📍Moving Points -> Depth📍👉KAIST & Adobe propose Seurat, a novel method that infers relative depth by examining the spatial relationships and temporal evolution of a set of tracked 2D trajectories (via off-the-shelf point tracking models). Repo & Demo to be released💙👉Review https://t.ly/qA2P5👉Paper https://lnkd.in/dpXDaQtM👉Project https://lnkd.in/d9qWYsjP👉Repo https://lnkd.in/dZEMDiJh

🦧 #Nvidia Describe Anything 🦧👉Nvidia unveils Describe Anything Model (DAM) the new SOTA in generating detailed descriptions for user-specified regions in images/videos, marked by points, boxes, scribbles, or masks. Repo under Apache, Dataset available and live demo on 🤗👉Review https://t.ly/la4JD👉Paper https://lnkd.in/dZh82xtV👉Project https://lnkd.in/dcv9V2ZF👉Repo https://lnkd.in/dJB9Ehtb🤗Demo https://lnkd.in/dXDb2MWU

🧊TAP in Persistent 3D Geometry🧊👉TAPIP3D is the novel SOTA for long-term 3D point tracking in mono-RGB/RGB-D. Videos as camera-stabilized spatio-temporal feature clouds, leveraging depth & motion to lift 2D video feats into a 3D world space where camera motion is effectively canceled. Code under Apache💙👉Review https://t.ly/oooMy👉Paper https://lnkd.in/d8uqjdE4👉Project https://tapip3d.github.io/👉Repo https://lnkd.in/dsvHP_8u

🌳MSVA Zero-Shot Multi-View🌳👉Niantic unveils MVSA, novel Multi-View Stereo Architecture to work anywhere by generalizing across diverse domains & depth ranges. Highly accurate & 3D-consistent depths. Code & models announced💙👉Review https://t.ly/LvuTh 👉Paper https://arxiv.org/pdf/2503.22430👉Project https://nianticlabs.github.io/mvsanywhere/👉Repo https://lnkd.in/ddQz9eps

🏓LATTE-MV: #3D Table Tennis🏓👉UC Berkeley unveils at #CVPR2025 a novel system for reconstructing monocular video of table tennis in 3D with uncertainty-aware controller that anticipates opponent actions. Code & Dataset announced, to be released💙👉Review https://t.ly/qPMOU👉Paper arxiv.org/pdf/2503.20936👉Project sastry-group.github.io/LATTE-MV/👉Repo github.com/sastry-group/LATTE-MV

About Slogin

Slogin is a product Joomline is based on the development of the same name the authorization component for Joomla. The service is designed to simplify configuring the authorization through social networks.