Detection and Correction of Lip-Sync Errors Using Audio and Video Fingerprints
A method for measuring and maintaining time synchronization between an audio and video stream is described. Audio and video fingerprints are used to create a combined audio/video synchronization signature (A/V Sync Signature) at a known reference point. This signature is used at later points to measure audio/video timing synchronization relative to the reference point. This method may be used, for example, to automatically detect and correct audio/video synchronization (i.e. lip-sync) errors in broadcast systems and other applications. — Advantages of the method described over other existing methods include that it does not require modification of the audio or video signals, it can respond to dynamically changing synchronization errors, and it is designed to be robust to modifications of the audio/video signals. — While the system requires data to be conveyed to the detection point, this data does not need to be synchronized with, or directly attached to, the audio or video streams. As this method uses fingerprints it also enables other fingerprinting applications within systems, such as content identification and verification. In addition, it may be used to maintain synchronization of other metadata associated with audio/video streams.
- Published
- 2009-10
- Content type
- Original Research
- DOI
- 10.5594/M001302
- ISBN
- 978-1-61482-943-0