However, existing work on models trained end-to-end perform only word classification, rather than sentence-level sequence prediction. More recent deep lip-reading approaches are end-to-end trainable (Wand et al., 2016 Chung & Zisserman, 2016a). Traditional approaches separated the problem into two stages: designing or learning visual features, and prediction. Lip-reading is the task of decoding text from the movement of a speaker’s mouth. Assael, Brendan Shillingford, Shimon Whiteson, Nando de Freitas Oxford University in collaboration with google deep-minds in 2016. This project was basically started by Yannis M.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |