| Record Type: |
Electronic resources
: Monograph/item
|
| Title/Author: |
Computer vision - ECCV 2024/ edited by Aleš Leonardis ... [et al.]. |
| Reminder of title: |
18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings. |
| remainder title: |
ECCV 2024 |
| other author: |
Leonardis, Aleš. |
| corporate name: |
European Conference on Computer Vision |
| Published: |
Cham :Springer Nature Switzerland : : 2025., |
| Description: |
lxxxv, 486 p. :ill., digital ;24 cm. |
| [NT 15003449]: |
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions -- InterFusion: Text-Driven Generation of 3D Human-Object Interaction -- GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval -- DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving -- Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition -- NeRF-XL: NeRF at Any Scale with Multi-GPU -- CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems -- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? -- Compositional Substitutivity of Visual Reasoning for Visual Question Answering -- LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models -- DNI: Dilutional Noise Initialization for Diffusion Video Editing -- Two-Stage Video Shadow Detection via Temporal-Spatial Adaption -- Towards Physical World Backdoor Attacks against Skeleton Action Recognition -- SAM-guided Graph Cut for 3D Instance Segmentation -- Fully Authentic Visual Question Answering Dataset from Online Communities -- Active Generation for Image Classification -- FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors -- Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes -- Understanding Multi-compositional learning in Vision and Language models via Category Theory -- FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients -- Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration -- Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image -- Diffusion-Guided Weakly Supervised Semantic Segmentation -- Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment -- When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset -- NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image -- Segment and Recognize Anything at Any Granularity. |
| Contained By: |
Springer Nature eBook |
| Subject: |
Computer vision - Congresses. - |
| Online resource: |
https://doi.org/10.1007/978-3-031-73195-2 |
| ISBN: |
9783031731952 |