| 紀錄類型: |
書目-電子資源
: Monograph/item
|
| 正題名/作者: |
Computer vision - ECCV 2024/ edited by Aleš Leonardis ... [et al.]. |
| 其他題名: |
18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings. |
| 其他題名: |
ECCV 2024 |
| 其他作者: |
Leonardis, Aleš. |
| 團體作者: |
European Conference on Computer Vision |
| 出版者: |
Cham :Springer Nature Switzerland : : 2024., |
| 面頁冊數: |
lxxxv, 481 p. :ill. (some col.), digital ;24 cm. |
| 內容註: |
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion -- SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers -- Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM -- Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation -- GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring -- Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring -- ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion -- CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning -- Curved Diffusion: A Generative Model With Optical Geometry Control -- Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians -- MeshSegmenter: Zero-Shot Mesh Segmentation via Texture Synthesis -- OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation -- Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures -- Conceptual Codebook Learning for Vision-Language Models -- LingoQA: Video Question Answering for Autonomous Driving -- AnimateMe: 4D Facial Expressions via Diffusion Models -- HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning -- LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis -- PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors -- Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention -- iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning -- Context Diffusion: In-Context Aware Image Generation -- Pose Guided Fine-Grained Sign Language Video Generation -- RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos -- Certifiably Robust Image Watermark -- Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery -- Online Zero-Shot Classification with CLIP. |
| Contained By: |
Springer Nature eBook |
| 標題: |
Computer vision - Congresses. - |
| 電子資源: |
https://doi.org/10.1007/978-3-031-72980-5 |
| ISBN: |
9783031729805 |