東華大學圖書館 |

Multimedia modeling = 31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025 : proceedings.. Part V /

Record Type:	Electronic resources : Monograph/item
Title/Author:	Multimedia modeling/ edited by Ichiro Ide ... [et al.].
Reminder of title:	31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025 : proceedings.
remainder title:	MMM 2025
other author:	Ide, Ichiro.
corporate name:	International Multimedia Modelling Conference
Published:	Singapore :Springer Nature Singapore : : 2025.,
Description:	xxi, 387 p. :ill., digital ;24 cm.
[NT 15003449]:	Special Session on Multimedia Research in Robotics -- Multimodal Engagement Prediction in Human-Robot Interaction using Transformer Neural Networks -- What Should Autonomous Robots Verbalize and What Should They Not? -- Special Session: SpIMA: Special Session on Spatial Intelligence in Multimedia Analytics -- Counting Unique Objects in Geo-Tagged Street Images: A Case Study Of Homeless Encampments in Los Angeles -- Special Session on Simulating Edge Computing and Multimodal AI: A Benchmark for Real-World Applications -- Correlation-Based Weighted Federated Learning with Multimodal Sensing and Knowledge Distillation: An Application on a Real-World Benchmark Dataset -- Leveraging Pruning, Quantization and Multi-Objective Optimization for an Efficient Deployment of Multi-modal Models -- Demo Papers -- A User Identification and Reading Style Detection System Based on Eye Movement Patterns During Reading -- AMDA: Advancing Multimedia Data Annotation for Human-centric Situations -- An Implementation of Networked JamSketch -- Badminton Footwork Practice via an Immersive Virtual Reality System -- Better Image Segmentation with Classification: Guiding Zero-Shot Models Using Class Activation Maps -- CleverFox: Integrating Visual Mnemonics with AI for Enhanced Language Learning -- Enhancing User Control in AI-Based Video Summarization for Social Media -- FencBuddy: Action-aware Depth Perception Training for Fencing Attacks -- Fingering Prediction for Classical Guitar: Dataset Creation and Model Development -- KuzushijiFontDiff: Diffusion Model for Japanese Kuzushiji Font Generation -- Leveraging Latent Diffusion in 3D Gaussian Splatting for Novel View Synthesis -- Movie Retrieval Systems Using Genre-guided Multimodal Learning Techniques -- Multi-Dimensional Exploration of Media Collection Metadata -- Multimodal Interoperability with the CLAMS Platform -- Real-time Visualizer for Turntablist Performance -- RoboDJ: Live Commentary Robots System Driven by Physical- and Cyber-world Observations -- SceneTextStyler: Editing Text with Style Transformation -- SelectSum: Topic-Based Selective Summarization of Speech-Based Videos -- Smart Driving Assistance with Real-time Risk Assessment and Personalized Driving Coaching to Enhance Road Safety -- System Demo of Modeling Smart University Campus Virtual Environments -- Training a Segmentation-based Visual Anonymization Service for Street Scenes -- Transformer-Based Audio Generation Conditioned by 2D Latent Maps: A Demonstration -- Using Language Models to Generate and Forget the Narrative Memories of an Assistive Robot -- WaveFontStyler: Font Style Transfer Based on Sound -- Video Browser Showdown -- diveXplore at the Video Browser Showdown 2025 -- Exquisitor at the Video Browser Showdown 2025: Unifying Conversational Search and User Relevance Feedback -- Feature-driven Video Segmentation and Advanced Querying with vitrivr-engine -- FUSIONISTA: Fusion of 3-D Information of Video in Retrieval System -- HORUS: Multimodal Large Language Models Framework for Video Retrieval at VBS 2025 -- IMSearch 2.0: Toward User-centric and Efficient Interactive Multimedia Retrieval System -- Interactive Video Search with Multi-modal LLM Video Captioning -- MediaMix: Multimedia Retrieval in Mixed Reality -- NII-UIT at VBS2025: Multimodal Video Retrieval with LLM Integration and Dynamic Temporal Search -- PraK Tool V3: Enhancing Video Item Search Using Localized Text and Texture Queries -- Simplified Video Retrieval in Virtual Reality with vitrivr-VR -- SnapSeek 2.0 at Video Browser Showdown 2025 -- VEAGLE: Eye Gaze-Assisted Guidance for Video Browser Showdown -- VERGE in VBS 2025 -- VideoEase at VBS2025: An Interactive Video Retrieval System -- ViewsInsight2.0: Enhancing Video Retrieval for VBS 2025 with an Automatic Query Generator Powered by Large Language Models -- ViFi: A Video Finding System at Video Browser Showdown 2025.
Contained By:	Springer Nature eBook
Subject:	Multimedia systems - Congresses. -
Online resource:	https://doi.org/10.1007/978-981-96-2074-6
ISBN:	9789819620746

Multimedia modeling = 31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025 : proceedings.. Part V /
Multimedia modeling31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025 : proceedings.Part V /[electronic resource] :MMM 2025edited by Ichiro Ide ... [et al.]. - Singapore :Springer Nature Singapore :2025. - xxi, 387 p. :ill., digital ;24 cm. - Lecture notes in computer science,155240302-9743 ;. - Lecture notes in computer science ;15524..

Special Session on Multimedia Research in Robotics -- Multimodal Engagement Prediction in Human-Robot Interaction using Transformer Neural Networks -- What Should Autonomous Robots Verbalize and What Should They Not? -- Special Session: SpIMA: Special Session on Spatial Intelligence in Multimedia Analytics -- Counting Unique Objects in Geo-Tagged Street Images: A Case Study Of Homeless Encampments in Los Angeles -- Special Session on Simulating Edge Computing and Multimodal AI: A Benchmark for Real-World Applications -- Correlation-Based Weighted Federated Learning with Multimodal Sensing and Knowledge Distillation: An Application on a Real-World Benchmark Dataset -- Leveraging Pruning, Quantization and Multi-Objective Optimization for an Efficient Deployment of Multi-modal Models -- Demo Papers -- A User Identification and Reading Style Detection System Based on Eye Movement Patterns During Reading -- AMDA: Advancing Multimedia Data Annotation for Human-centric Situations -- An Implementation of Networked JamSketch -- Badminton Footwork Practice via an Immersive Virtual Reality System -- Better Image Segmentation with Classification: Guiding Zero-Shot Models Using Class Activation Maps -- CleverFox: Integrating Visual Mnemonics with AI for Enhanced Language Learning -- Enhancing User Control in AI-Based Video Summarization for Social Media -- FencBuddy: Action-aware Depth Perception Training for Fencing Attacks -- Fingering Prediction for Classical Guitar: Dataset Creation and Model Development -- KuzushijiFontDiff: Diffusion Model for Japanese Kuzushiji Font Generation -- Leveraging Latent Diffusion in 3D Gaussian Splatting for Novel View Synthesis -- Movie Retrieval Systems Using Genre-guided Multimodal Learning Techniques -- Multi-Dimensional Exploration of Media Collection Metadata -- Multimodal Interoperability with the CLAMS Platform -- Real-time Visualizer for Turntablist Performance -- RoboDJ: Live Commentary Robots System Driven by Physical- and Cyber-world Observations -- SceneTextStyler: Editing Text with Style Transformation -- SelectSum: Topic-Based Selective Summarization of Speech-Based Videos -- Smart Driving Assistance with Real-time Risk Assessment and Personalized Driving Coaching to Enhance Road Safety -- System Demo of Modeling Smart University Campus Virtual Environments -- Training a Segmentation-based Visual Anonymization Service for Street Scenes -- Transformer-Based Audio Generation Conditioned by 2D Latent Maps: A Demonstration -- Using Language Models to Generate and Forget the Narrative Memories of an Assistive Robot -- WaveFontStyler: Font Style Transfer Based on Sound -- Video Browser Showdown -- diveXplore at the Video Browser Showdown 2025 -- Exquisitor at the Video Browser Showdown 2025: Unifying Conversational Search and User Relevance Feedback -- Feature-driven Video Segmentation and Advanced Querying with vitrivr-engine -- FUSIONISTA: Fusion of 3-D Information of Video in Retrieval System -- HORUS: Multimodal Large Language Models Framework for Video Retrieval at VBS 2025 -- IMSearch 2.0: Toward User-centric and Efficient Interactive Multimedia Retrieval System -- Interactive Video Search with Multi-modal LLM Video Captioning -- MediaMix: Multimedia Retrieval in Mixed Reality -- NII-UIT at VBS2025: Multimodal Video Retrieval with LLM Integration and Dynamic Temporal Search -- PraK Tool V3: Enhancing Video Item Search Using Localized Text and Texture Queries -- Simplified Video Retrieval in Virtual Reality with vitrivr-VR -- SnapSeek 2.0 at Video Browser Showdown 2025 -- VEAGLE: Eye Gaze-Assisted Guidance for Video Browser Showdown -- VERGE in VBS 2025 -- VideoEase at VBS2025: An Interactive Video Retrieval System -- ViewsInsight2.0: Enhancing Video Retrieval for VBS 2025 with an Automatic Query Generator Powered by Large Language Models -- ViFi: A Video Finding System at Video Browser Showdown 2025.

This five-volume set LNCS 15520-15524 constitutes the proceedings of the 31st International Conference on Multimedia Modeling, MMM 2025, held in Nara, Japan, January 8-10, 2025. The 135 full papers and 41 short papers presented in these proceedings were carefully reviewed and selected from 348 submissions. The MMM conference was organized in topics related to multimedia modelling, particularly: audio, image, video processing, coding and compression; multimodal analysis for retrieval applications, and multimedia fusion methods.

ISBN: 9789819620746

Standard No.: 10.1007/978-981-96-2074-6doiSubjects--Topical Terms:

622937
Multimedia systems
--Congresses.

LC Class. No.: QA76.575 / .I58 2025

Dewey Class. No.: 006.7

Multimedia modeling = 31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025 : proceedings.. Part V /
LDR:05619nmm a2200349 a 4500 001 2408148
003 DE-He213
005 20250101115250.0
006 m o d
007 cr nn 008maaau
008 260204s2025 si s 0 eng d
020 $a 9789819620746 $q (electronic bk.)
020 $a 9789819620739 $q (paper)
024 7 $a 10.1007/978-981-96-2074-6 $2 doi
035 $a 978-981-96-2074-6
040 $a GP $c GP
041 0 $a eng
050 4 $a QA76.575 $b .I58 2025
072 7 $a UYQV $2 bicssc
072 7 $a COM016000 $2 bisacsh
072 7 $a UYQV $2 thema
082 0 4 $a 006.7 $2 23
090 $a QA76.575 $b .I61 2025
111 2 $a International Multimedia Modelling Conference $n (31st : $d 2025 : $c Nara-Shi, Japan) $3 3780421
245 1 0 $a Multimedia modeling $h [electronic resource] : $b 31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025 : proceedings. $n Part V / $c edited by Ichiro Ide ... [et al.].
246 3 $a MMM 2025
260 $a Singapore : $b Springer Nature Singapore : $b Imprint: Springer, $c 2025.
300 $a xxi, 387 p. : $b ill., digital ; $c 24 cm.
490 1 $a Lecture notes in computer science, $x 0302-9743 ; $v 15524
505 0 $a Special Session on Multimedia Research in Robotics -- Multimodal Engagement Prediction in Human-Robot Interaction using Transformer Neural Networks -- What Should Autonomous Robots Verbalize and What Should They Not? -- Special Session: SpIMA: Special Session on Spatial Intelligence in Multimedia Analytics -- Counting Unique Objects in Geo-Tagged Street Images: A Case Study Of Homeless Encampments in Los Angeles -- Special Session on Simulating Edge Computing and Multimodal AI: A Benchmark for Real-World Applications -- Correlation-Based Weighted Federated Learning with Multimodal Sensing and Knowledge Distillation: An Application on a Real-World Benchmark Dataset -- Leveraging Pruning, Quantization and Multi-Objective Optimization for an Efficient Deployment of Multi-modal Models -- Demo Papers -- A User Identification and Reading Style Detection System Based on Eye Movement Patterns During Reading -- AMDA: Advancing Multimedia Data Annotation for Human-centric Situations -- An Implementation of Networked JamSketch -- Badminton Footwork Practice via an Immersive Virtual Reality System -- Better Image Segmentation with Classification: Guiding Zero-Shot Models Using Class Activation Maps -- CleverFox: Integrating Visual Mnemonics with AI for Enhanced Language Learning -- Enhancing User Control in AI-Based Video Summarization for Social Media -- FencBuddy: Action-aware Depth Perception Training for Fencing Attacks -- Fingering Prediction for Classical Guitar: Dataset Creation and Model Development -- KuzushijiFontDiff: Diffusion Model for Japanese Kuzushiji Font Generation -- Leveraging Latent Diffusion in 3D Gaussian Splatting for Novel View Synthesis -- Movie Retrieval Systems Using Genre-guided Multimodal Learning Techniques -- Multi-Dimensional Exploration of Media Collection Metadata -- Multimodal Interoperability with the CLAMS Platform -- Real-time Visualizer for Turntablist Performance -- RoboDJ: Live Commentary Robots System Driven by Physical- and Cyber-world Observations -- SceneTextStyler: Editing Text with Style Transformation -- SelectSum: Topic-Based Selective Summarization of Speech-Based Videos -- Smart Driving Assistance with Real-time Risk Assessment and Personalized Driving Coaching to Enhance Road Safety -- System Demo of Modeling Smart University Campus Virtual Environments -- Training a Segmentation-based Visual Anonymization Service for Street Scenes -- Transformer-Based Audio Generation Conditioned by 2D Latent Maps: A Demonstration -- Using Language Models to Generate and Forget the Narrative Memories of an Assistive Robot -- WaveFontStyler: Font Style Transfer Based on Sound -- Video Browser Showdown -- diveXplore at the Video Browser Showdown 2025 -- Exquisitor at the Video Browser Showdown 2025: Unifying Conversational Search and User Relevance Feedback -- Feature-driven Video Segmentation and Advanced Querying with vitrivr-engine -- FUSIONISTA: Fusion of 3-D Information of Video in Retrieval System -- HORUS: Multimodal Large Language Models Framework for Video Retrieval at VBS 2025 -- IMSearch 2.0: Toward User-centric and Efficient Interactive Multimedia Retrieval System -- Interactive Video Search with Multi-modal LLM Video Captioning -- MediaMix: Multimedia Retrieval in Mixed Reality -- NII-UIT at VBS2025: Multimodal Video Retrieval with LLM Integration and Dynamic Temporal Search -- PraK Tool V3: Enhancing Video Item Search Using Localized Text and Texture Queries -- Simplified Video Retrieval in Virtual Reality with vitrivr-VR -- SnapSeek 2.0 at Video Browser Showdown 2025 -- VEAGLE: Eye Gaze-Assisted Guidance for Video Browser Showdown -- VERGE in VBS 2025 -- VideoEase at VBS2025: An Interactive Video Retrieval System -- ViewsInsight2.0: Enhancing Video Retrieval for VBS 2025 with an Automatic Query Generator Powered by Large Language Models -- ViFi: A Video Finding System at Video Browser Showdown 2025.
520 $a This five-volume set LNCS 15520-15524 constitutes the proceedings of the 31st International Conference on Multimedia Modeling, MMM 2025, held in Nara, Japan, January 8-10, 2025. The 135 full papers and 41 short papers presented in these proceedings were carefully reviewed and selected from 348 submissions. The MMM conference was organized in topics related to multimedia modelling, particularly: audio, image, video processing, coding and compression; multimodal analysis for retrieval applications, and multimedia fusion methods.
650 0 $a Multimedia systems $v Congresses. $3 622937
650 0 $a Computer graphics $x Congresses. $3 659671
650 1 4 $a Computer Vision. $3 3538524
650 2 4 $a Computer Imaging, Vision, Pattern Recognition and Graphics. $3 890871
650 2 4 $a Signal, Speech and Image Processing. $3 3592727
650 2 4 $a Automated Pattern Recognition. $3 3538549
650 2 4 $a Computer and Information Systems Applications. $3 3538505
650 2 4 $a Information Storage and Retrieval. $3 761906
700 1 $a Ide, Ichiro. $3 3495429
710 2 $a SpringerLink (Online service) $3 836513
773 0 $t Springer Nature eBook
830 0 $a Lecture notes in computer science ; $v 15524. $3 3780436
856 4 0 $u https://doi.org/10.1007/978-981-96-2074-6
950 $a Computer Science (SpringerNature-11645)