東華大學圖書館 |

Computer vision - ECCV 2024 = 18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings.. Part LVI /

紀錄類型:	書目-電子資源 : Monograph/item
正題名/作者:	Computer vision - ECCV 2024/ edited by Aleš Leonardis ... [et al.].
其他題名:	18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings.
其他題名:	ECCV 2024
其他作者:	Leonardis, Aleš.
團體作者:	European Conference on Computer Vision
出版者:	Cham :Springer Nature Switzerland : : 2025.,
面頁冊數:	lxxxv, 499 p. :ill. (chiefly color), digital ;24 cm.
內容註:	HowToCaption: Prompting LLMs to Transform Video Annotations at Scale -- LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection -- Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction -- On Pretraining Data Diversity for Self-Supervised Learning -- Look Around and Learn: Self-Training Object Detection by Exploration -- Bayesian Self-Training for Semi-Supervised 3D Segmentation -- Motion and Structure from Event-based Normal Flow -- ParCo: Part-Coordinating Text-to-Motion Synthesis -- Learning to Complement and to Defer to Multiple Users -- Tiny Models are the Computational Saver for Large Models -- DragVideo: Interactive Drag-style Video Editing -- Multi-Sentence Grounding for Long-term Instructional Video -- Do Generalised Classifiers really work on Human Drawn Sketches? -- KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding -- Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° -- MotionDirector: Motion Customization of Text-to-Video Diffusion Models -- Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer -- Enhanced Motion Forecasting with Visual Relation Reasoning -- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression -- Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers -- LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar -- MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models -- Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models -- Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer -- Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors -- Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation -- StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion.
Contained By:	Springer Nature eBook
標題:	Computer vision - Congresses. -
電子資源:	https://doi.org/10.1007/978-3-031-72992-8
ISBN:	9783031729928

Computer vision - ECCV 2024 = 18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings.. Part LVI /
Computer vision - ECCV 202418th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings.Part LVI /[electronic resource] :ECCV 2024edited by Aleš Leonardis ... [et al.]. - Cham :Springer Nature Switzerland :2025. - lxxxv, 499 p. :ill. (chiefly color), digital ;24 cm. - Lecture notes in computer science,151141611-3349 ;. - Lecture notes in computer science ;15114..

HowToCaption: Prompting LLMs to Transform Video Annotations at Scale -- LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection -- Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction -- On Pretraining Data Diversity for Self-Supervised Learning -- Look Around and Learn: Self-Training Object Detection by Exploration -- Bayesian Self-Training for Semi-Supervised 3D Segmentation -- Motion and Structure from Event-based Normal Flow -- ParCo: Part-Coordinating Text-to-Motion Synthesis -- Learning to Complement and to Defer to Multiple Users -- Tiny Models are the Computational Saver for Large Models -- DragVideo: Interactive Drag-style Video Editing -- Multi-Sentence Grounding for Long-term Instructional Video -- Do Generalised Classifiers really work on Human Drawn Sketches? -- KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding -- Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° -- MotionDirector: Motion Customization of Text-to-Video Diffusion Models -- Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer -- Enhanced Motion Forecasting with Visual Relation Reasoning -- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression -- Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers -- LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar -- MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models -- Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models -- Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer -- Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors -- Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation -- StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion.

The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024. The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.

ISBN: 9783031729928

Standard No.: 10.1007/978-3-031-72992-8doiSubjects--Topical Terms:

570734
Computer vision
--Congresses.

LC Class. No.: TA1634

Dewey Class. No.: 006.37

Computer vision - ECCV 2024 = 18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings.. Part LVI /
LDR:03974nmm a2200349 a 4500 001 2407900
003 DE-He213
005 20241029115633.0
006 m d
007 cr nn 008maaau
008 260204s2025 sz s 0 eng d
020 $a 9783031729928 $q (electronic bk.)
020 $a 9783031729911 $q (paper)
024 7 $a 10.1007/978-3-031-72992-8 $2 doi
035 $a 978-3-031-72992-8
040 $a GP $c GP
041 0 $a eng
050 4 $a TA1634
072 7 $a UYT $2 bicssc
072 7 $a COM016000 $2 bisacsh
072 7 $a UYT $2 thema
082 0 4 $a 006.37 $2 23
090 $a TA1634 $b .E89 2024
111 2 $a European Conference on Computer Vision $n (18th : $d 2024 : $c Milan, Italy) $3 3733323
245 1 0 $a Computer vision - ECCV 2024 $h [electronic resource] : $b 18th European Conference, Milan, Italy, September 29-October 4, 2024 : proceedings. $n Part LVI / $c edited by Aleš Leonardis ... [et al.].
246 3 $a ECCV 2024
260 $a Cham : $b Springer Nature Switzerland : $b Imprint: Springer, $c 2025.
300 $a lxxxv, 499 p. : $b ill. (chiefly color), digital ; $c 24 cm.
490 1 $a Lecture notes in computer science, $x 1611-3349 ; $v 15114
505 0 $a HowToCaption: Prompting LLMs to Transform Video Annotations at Scale -- LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection -- Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction -- On Pretraining Data Diversity for Self-Supervised Learning -- Look Around and Learn: Self-Training Object Detection by Exploration -- Bayesian Self-Training for Semi-Supervised 3D Segmentation -- Motion and Structure from Event-based Normal Flow -- ParCo: Part-Coordinating Text-to-Motion Synthesis -- Learning to Complement and to Defer to Multiple Users -- Tiny Models are the Computational Saver for Large Models -- DragVideo: Interactive Drag-style Video Editing -- Multi-Sentence Grounding for Long-term Instructional Video -- Do Generalised Classifiers really work on Human Drawn Sketches? -- KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding -- Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° -- MotionDirector: Motion Customization of Text-to-Video Diffusion Models -- Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer -- Enhanced Motion Forecasting with Visual Relation Reasoning -- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression -- Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers -- LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar -- MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models -- Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models -- Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer -- Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors -- Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation -- StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion.
520 $a The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024. The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.
650 0 $a Computer vision $x Congresses. $3 570734
650 0 $a Pattern recognition systems $v Congresses. $3 563039
650 1 4 $a Computer Imaging, Vision, Pattern Recognition and Graphics. $3 890871
650 2 4 $a Image Processing. $3 891209
650 2 4 $a Computer Communication Networks. $3 775497
650 2 4 $a Machine Learning. $3 3382522
650 2 4 $a Special Purpose and Application-Based Systems. $3 892492
650 2 4 $a User Interfaces and Human Computer Interaction. $3 892554
700 1 $a Leonardis, Aleš. $3 3733324
710 2 $a SpringerLink (Online service) $3 836513
773 0 $t Springer Nature eBook
830 0 $a Lecture notes in computer science ; $v 15114. $3 3780060
856 4 0 $u https://doi.org/10.1007/978-3-031-72992-8
950 $a Computer Science (SpringerNature-11645)