Multimodal collaboration and human-computer interaction

hgpu.org » Applications » Computer science » Multimodal collaboration and human-computer interaction

Multimodal collaboration and human-computer interaction

Zhengyou Zhang

Microsoft Research, Redmond, WA, USA

IEEE International Conference on Multimedia and Expo, 2009. ICME 2009

DOI:10.1109/ICME.2009.5202823

BibTeX

Source

1393

views

The research effort at Microsoft research on multimodal collaboration and human-computer interaction aims at developing tools that allow people across geographically distributed sites to interact collaboratively with immersive experience. Our prototype systems consist of cameras, displays, speakers, microphones, computer controllable lights, and/or input devices such as touch sensitive surface, stylus, keyboard, and mouse. They require real-time processing a huge amount of data, such as foreground-background substraction, region-of-interest extraction, color estimation and correction, speaker detection, stereo matching, 3D reconstruction and rendering, without mentioning audio and video encoding and decoding possibly involving multiple microphones and cameras. Some of the processing can be easily parallelizable through general-purpose computation on graphics processing units (GPGPU) or on a multi-core processor machine, while others are not so trivial. In this extended summary, the author describe two projects: Visual echo cancellation in shared tele-collaborative space, and distributed meeting capture and broadcasting system. During the talk, the author will also present two recent projects: personal telepresence station and situated interaction.

Tags: Computer science, Rendering, Visualization

April 23, 2011 by hgpu

No votes yet.

Please wait...

* * *

high performance computing on graphics processing units: hgpu.org

Multimodal collaboration and human-computer interaction

Recent source codes

XaaS containers

microSYCL: SYCL micro-benchmarks repository

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

Most viewed papers (last 30 days)

Multimodal collaboration and human-computer interaction

Share this:

Recent source codes

Most viewed papers (last 30 days)