| Record Type: |
Electronic resources
: Monograph/item
|
| Title/Author: |
Computer vision - ECCV 2024/ edited by Aleš Leonardis ... [et al.]. |
| Reminder of title: |
18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings. |
| remainder title: |
ECCV 2024 |
| other author: |
Leonardis, Aleš. |
| corporate name: |
European Conference on Computer Vision |
| Published: |
Cham :Springer Nature Switzerland : : 2025., |
| Description: |
lxxxv, 499 p. :ill. (chiefly color), digital ;24 cm. |
| [NT 15003449]: |
HowToCaption: Prompting LLMs to Transform Video Annotations at Scale -- LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection -- Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction -- On Pretraining Data Diversity for Self-Supervised Learning -- Look Around and Learn: Self-Training Object Detection by Exploration -- Bayesian Self-Training for Semi-Supervised 3D Segmentation -- Motion and Structure from Event-based Normal Flow -- ParCo: Part-Coordinating Text-to-Motion Synthesis -- Learning to Complement and to Defer to Multiple Users -- Tiny Models are the Computational Saver for Large Models -- DragVideo: Interactive Drag-style Video Editing -- Multi-Sentence Grounding for Long-term Instructional Video -- Do Generalised Classifiers really work on Human Drawn Sketches? -- KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding -- Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° -- MotionDirector: Motion Customization of Text-to-Video Diffusion Models -- Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer -- Enhanced Motion Forecasting with Visual Relation Reasoning -- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression -- Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers -- LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar -- MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models -- Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models -- Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer -- Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors -- Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation -- StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion. |
| Contained By: |
Springer Nature eBook |
| Subject: |
Computer vision - Congresses. - |
| Online resource: |
https://doi.org/10.1007/978-3-031-72992-8 |
| ISBN: |
9783031729928 |