東華大學圖書館 |

Computational visual media = 13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025 : proceedings.. Part III /

Record Type:	Electronic resources : Monograph/item
Title/Author:	Computational visual media/ edited by Piotr Didyk, Junhui Hou.
Reminder of title:	13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025 : proceedings.
remainder title:	CVM 2025
other author:	Didyk, Piotr.
corporate name:	CVM (Conference)
Published:	Singapore :Springer Nature Singapore : : 2025.,
Description:	xvii, 458 p. :ill. (some col.), digital ;24 cm.
[NT 15003449]:	Image and Video Analysis DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye Cameras -- DIMATrack: Dimension Aware Data Association for Multi-Object Tracking -- Efficient Transformer Network for Visible and Ultraviolet Object Tracking -- LightGR-Transformer: Light Grouped Residual Transformer for Multispectral Object Detection -- ADMMOA: Attribute-Driven Multimodal Optimization for Face Recognition Adversarial Attacks -- Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring -- Multimodal Learning Reinforced Label Denoising for Weakly-Supervised Audio-Visual Video Parsing -- Bridging the Modality Gap: Advancing Multimodal Human Pose Estimation with Modality-Adaptive Pose Estimator and Novel Benchmark Datasets -- Momentum-Based Uni-Modal Soft-Label Alignment and Multi-Modal Latent Projection Networks for Optimizing Image-Text Retrieval -- Multi-Granularity and Multi-Modal Prompt Learning for Person Re-Identification -- Local and Global Feature Cross-attention Multimodal Place Recognition -- IML-CMM - A Multimodal Sentiment Analysis Framework Integrating Intra-Modal Learning and Cross-Modal Mixup Enhancement -- Geometrical Processing MCFG with GUMAP: A Simple and Effective Clustering Framework on Grassmann Manifold -- Joint UMAP for Visualization of Time-Dependent DataUnsupervised Domain Adaptation on Point Cloud Classification via Imposing Structural Manifolds into Representation Space -- Applications Learning Adaptive Basis Fonts to Fuse Content Features for Few-shot Font Generation -- TaiCrowd: A High-Performance Simulation Framework for Massive Crowd -- Feature Disentanglement and Fusion Model for Multi-Source Domain Adaptation with Domain-Specific Features -- A Trademark Retrieval Method Based on Self-Supervised Learning -- Weaken Noisy Feature: Boosting Semi-Supervised Learning by Noise Estimation -- Multi-Dimension Full Scene Integrated Visual Emotion Analysis Network -- Gap-KD: Bridging the Significant Capacity Gap Between Teacher and Student Model.
Contained By:	Springer Nature eBook
Subject:	Image processing - Congresses. -
Online resource:	https://doi.org/10.1007/978-981-96-5815-2
ISBN:	9789819658152

Computational visual media = 13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025 : proceedings.. Part III /
Computational visual media13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025 : proceedings.Part III /[electronic resource] :CVM 2025edited by Piotr Didyk, Junhui Hou. - Singapore :Springer Nature Singapore :2025. - xvii, 458 p. :ill. (some col.), digital ;24 cm. - Lecture notes in computer science,156651611-3349 ;. - Lecture notes in computer science ;15665..

Image and Video Analysis DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye Cameras -- DIMATrack: Dimension Aware Data Association for Multi-Object Tracking -- Efficient Transformer Network for Visible and Ultraviolet Object Tracking -- LightGR-Transformer: Light Grouped Residual Transformer for Multispectral Object Detection -- ADMMOA: Attribute-Driven Multimodal Optimization for Face Recognition Adversarial Attacks -- Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring -- Multimodal Learning Reinforced Label Denoising for Weakly-Supervised Audio-Visual Video Parsing -- Bridging the Modality Gap: Advancing Multimodal Human Pose Estimation with Modality-Adaptive Pose Estimator and Novel Benchmark Datasets -- Momentum-Based Uni-Modal Soft-Label Alignment and Multi-Modal Latent Projection Networks for Optimizing Image-Text Retrieval -- Multi-Granularity and Multi-Modal Prompt Learning for Person Re-Identification -- Local and Global Feature Cross-attention Multimodal Place Recognition -- IML-CMM - A Multimodal Sentiment Analysis Framework Integrating Intra-Modal Learning and Cross-Modal Mixup Enhancement -- Geometrical Processing MCFG with GUMAP: A Simple and Effective Clustering Framework on Grassmann Manifold -- Joint UMAP for Visualization of Time-Dependent DataUnsupervised Domain Adaptation on Point Cloud Classification via Imposing Structural Manifolds into Representation Space -- Applications Learning Adaptive Basis Fonts to Fuse Content Features for Few-shot Font Generation -- TaiCrowd: A High-Performance Simulation Framework for Massive Crowd -- Feature Disentanglement and Fusion Model for Multi-Source Domain Adaptation with Domain-Specific Features -- A Trademark Retrieval Method Based on Self-Supervised Learning -- Weaken Noisy Feature: Boosting Semi-Supervised Learning by Noise Estimation -- Multi-Dimension Full Scene Integrated Visual Emotion Analysis Network -- Gap-KD: Bridging the Significant Capacity Gap Between Teacher and Student Model.

This book constitutes the refereed proceedings of CVM 2025, the 13th International Conference on Computational Visual Media, held in Hong Kong SAR, China, in April 2025. The 67 full papers were carefully reviewed and selected from 335 submissions. The papers are organized in topical sections as follows: Part I: Medical Image Analysis, Detection and Recognition, Image Enhancement and Generation, Vision Modeling in Complex Scenarios Part II: 3D Geometry and Rendering, Generation and Editing, Image Processing and Optimization Part III: Image and Video Analysis, Multimodal Learning, Geometrical Processing, Applications.

ISBN: 9789819658152

Standard No.: 10.1007/978-981-96-5815-2doiSubjects--Topical Terms:

623655
Image processing
--Congresses.

LC Class. No.: TA1637 / .C66 2025

Dewey Class. No.: 006.42

Computational visual media = 13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025 : proceedings.. Part III /
LDR:03849nmm a2200349 a 4500 001 2409819
003 DE-He213
005 20250426124718.0
006 m d
007 cr nn 008maaau
008 260204s2025 si s 0 eng d
020 $a 9789819658152 $q (electronic bk.)
020 $a 9789819658145 $q (paper)
024 7 $a 10.1007/978-981-96-5815-2 $2 doi
035 $a 978-981-96-5815-2
040 $a GP $c GP
041 0 $a eng
050 4 $a TA1637 $b .C66 2025
072 7 $a UYQV $2 bicssc
072 7 $a COM016000 $2 bisacsh
072 7 $a UYQV $2 thema
082 0 4 $a 006.42 $2 23
090 $a TA1637 $b .C993 2025
111 2 $a CVM (Conference) $n (13th : $d 2025 : $c Hong Kong, China) $3 3783288
245 1 0 $a Computational visual media $h [electronic resource] : $b 13th International Conference, CVM 2025, Hong Kong SAR, China, April 19-21, 2025 : proceedings. $n Part III / $c edited by Piotr Didyk, Junhui Hou.
246 3 $a CVM 2025
260 $a Singapore : $b Springer Nature Singapore : $b Imprint: Springer, $c 2025.
300 $a xvii, 458 p. : $b ill. (some col.), digital ; $c 24 cm.
490 1 $a Lecture notes in computer science, $x 1611-3349 ; $v 15665
505 0 $a Image and Video Analysis DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye Cameras -- DIMATrack: Dimension Aware Data Association for Multi-Object Tracking -- Efficient Transformer Network for Visible and Ultraviolet Object Tracking -- LightGR-Transformer: Light Grouped Residual Transformer for Multispectral Object Detection -- ADMMOA: Attribute-Driven Multimodal Optimization for Face Recognition Adversarial Attacks -- Training-Free Language-Guided Video Summarization via Multi-Grained Saliency Scoring -- Multimodal Learning Reinforced Label Denoising for Weakly-Supervised Audio-Visual Video Parsing -- Bridging the Modality Gap: Advancing Multimodal Human Pose Estimation with Modality-Adaptive Pose Estimator and Novel Benchmark Datasets -- Momentum-Based Uni-Modal Soft-Label Alignment and Multi-Modal Latent Projection Networks for Optimizing Image-Text Retrieval -- Multi-Granularity and Multi-Modal Prompt Learning for Person Re-Identification -- Local and Global Feature Cross-attention Multimodal Place Recognition -- IML-CMM - A Multimodal Sentiment Analysis Framework Integrating Intra-Modal Learning and Cross-Modal Mixup Enhancement -- Geometrical Processing MCFG with GUMAP: A Simple and Effective Clustering Framework on Grassmann Manifold -- Joint UMAP for Visualization of Time-Dependent DataUnsupervised Domain Adaptation on Point Cloud Classification via Imposing Structural Manifolds into Representation Space -- Applications Learning Adaptive Basis Fonts to Fuse Content Features for Few-shot Font Generation -- TaiCrowd: A High-Performance Simulation Framework for Massive Crowd -- Feature Disentanglement and Fusion Model for Multi-Source Domain Adaptation with Domain-Specific Features -- A Trademark Retrieval Method Based on Self-Supervised Learning -- Weaken Noisy Feature: Boosting Semi-Supervised Learning by Noise Estimation -- Multi-Dimension Full Scene Integrated Visual Emotion Analysis Network -- Gap-KD: Bridging the Significant Capacity Gap Between Teacher and Student Model.
520 $a This book constitutes the refereed proceedings of CVM 2025, the 13th International Conference on Computational Visual Media, held in Hong Kong SAR, China, in April 2025. The 67 full papers were carefully reviewed and selected from 335 submissions. The papers are organized in topical sections as follows: Part I: Medical Image Analysis, Detection and Recognition, Image Enhancement and Generation, Vision Modeling in Complex Scenarios Part II: 3D Geometry and Rendering, Generation and Editing, Image Processing and Optimization Part III: Image and Video Analysis, Multimodal Learning, Geometrical Processing, Applications.
650 0 $a Image processing $x Congresses. $3 623655
650 0 $a Computer graphics $x Congresses. $3 659671
650 0 $a Computer vision $x Congresses. $3 570734
650 1 4 $a Computer Vision. $3 3538524
650 2 4 $a Automated Pattern Recognition. $3 3538549
650 2 4 $a Computer and Information Systems Applications. $3 3538505
650 2 4 $a Computer Graphics. $3 892532
650 2 4 $a Artificial Intelligence. $3 769149
650 2 4 $a Algorithms. $3 536374
700 1 $a Didyk, Piotr. $3 3783289
700 1 $a Hou, Junhui. $3 3783290
710 2 $a SpringerLink (Online service) $3 836513
773 0 $t Springer Nature eBook
830 0 $a Lecture notes in computer science ; $v 15665. $3 3783293
856 4 0 $u https://doi.org/10.1007/978-981-96-5815-2
950 $a Computer Science (SpringerNature-11645)