HOME

TheInfoList



OR:

Free viewpoint television (FTV) is a system for viewing natural video, allowing the user to interactively control the viewpoint and generate new views of a dynamic scene from any 3D position. The equivalent system for computer-simulated video is known as
virtual reality Virtual reality (VR) is a Simulation, simulated experience that employs 3D near-eye displays and pose tracking to give the user an immersive feel of a virtual world. Applications of virtual reality include entertainment (particularly video gam ...
. With FTV, the focus of attention can be controlled by the viewers rather than a director, meaning that each viewer may be observing a unique viewpoint. It remains to be seen how FTV will affect television watching as a group activity.


History

Systems for rendering arbitrary views of natural scenes have been well known in the
computer vision Computer vision tasks include methods for image sensor, acquiring, Image processing, processing, Image analysis, analyzing, and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical ...
community for a long time but only in recent years has the speed and quality reached levels that are suitable for serious consideration as an end-user system. Professor Masayuki Tanimoto from
Nagoya University , abbreviated to or NU, is a Japanese national research university located in Chikusa-ku, Nagoya. It was established in 1939 as the last of the nine Imperial Universities in the then Empire of Japan, and is now a Designated National Universit ...
(Japan) has done much to promote the use of the term "free viewpoint television" and has published many papers on the ray space representation, although other techniques can be, and are used for FTV. QuickTime VR might be considered a predecessor to FTV.


Capture and display

In order to acquire the views necessary to allow a high-quality rendering of the scene from any angle, several cameras are placed around the scene; either in a studio environment or an outdoor venue, such as a sporting arena for example. The output Multiview Video (MVV) must then be packaged suitably so that the data may be compressed and also so that the users' viewing device may easily access the relevant views to interpolate new views. It is not enough to simply place cameras around the scene to be captured. The geometry of the camera set up must be measured by a process known in computer vision as "camera calibration." Manual alignment would be too cumbersome so typically a "best effort" alignment is performed prior to capturing a test pattern that is used to generate calibration parameters. Restricted free viewpoint television views for large environments can be captured from a single location camera system mounted on a moving platform. Depth data must also be captured, which is necessary to generate the free viewpoint. The Google Street View capture system is an example with limited functionality. The first full commercial implementation, iFlex, was delivered in 2009 by Real Time Race. Multiview video capture varies from partial (usually about 30 degrees) to complete (360 degrees) coverage of the scene. Therefore, it is possible to output stereoscopic views suitable for viewing with a 3D display or other 3D methods. Systems with more physical cameras can capture images with more coverage of the viewable scene, however, it is likely that certain regions will always be occluded from any viewpoint. A larger number of cameras should make it possible to obtain high quality output because less interpolation is needed. More cameras mean that efficient coding of the Multiview Video is required. This may not be such a big disadvantage as there are representations that can remove the redundancy in MVV; such as inter view coding using
MPEG-4 MPEG-4 is a group of international standards for the compression of digital audio and visual data, multimedia systems, and file storage formats. It was originally introduced in late 1998 as a group of audio and video coding formats and related ...
or Multiview Video Coding, the ray space representation, geometry videos, etc. In terms of hardware, the user requires a viewing device that can decode MVV and synthesize new viewpoints, and a 2D or 3D display.


Standardization

The
Moving Picture Experts Group The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and ...
(MPEG) has normalized Annex H of MPEG-4 AVC in March 2009 called Multiview Video Coding after the work of a group called '3DAV' (3D Audio and Visual) headed by Aljoscha Smolic at the Heinrich-Hertz Institute.


See also

* 3D reconstruction *
Rendering (computer graphics) Rendering is the process of generating a physically-based rendering, photorealistic or Non-photorealistic rendering, non-photorealistic image from input data such as 3D models. The word "rendering" (in one of its senses) originally meant the ...
* Volumetric video


References


Bibliography

*


External links


Canon announced development of a Free Viewpoint TV system
on 21 September 2017, to be showcased at Inter BEE 2017.
iview
is a British DTI project between
BBC The British Broadcasting Corporation (BBC) is a British public service broadcaster headquartered at Broadcasting House in London, England. Originally established in 1922 as the British Broadcasting Company, it evolved into its current sta ...
, Snell & Wilcox and
University of Surrey The University of Surrey is a public research university in Guildford, Surrey, England. The university received its Royal Charter, royal charter in 1966, along with a Plate glass university, number of other institutions following recommendations ...
to develop an FTV system.
Eye Vision
is a system developed by Professor Takeo Kanade at CMU for CBS's coverage of Super Bowl XXXV. The user is not able to change viewpoint but the camera operator is able to choose any virtual viewpoint by synthesizing images from an active vision system. * created the first-ever live on-air 3D reconstruction during the London 2012 Olympic Games; their website now seems to point t
Intel freeD 360 Replay
{{Computer vision Year of introduction missing Television technology Applications of computer vision 3D imaging Motion capture