Deep Photo Rally: Let’s Gather Conversational Pictures
Abstract
In this paper, we propose an anthropomorphic approach to generate speech sentences of a specific object according to surrounding circumstances using the recent Deep Neural Networks technology. In the proposal approach, the user can have pseudo communication with the object by photographing the object with a mobile terminal. We introduce some examples of application of the proposal approach to entertainment products, and show that this is an anthropomorphic approach capable of interacting with the environment.
Domains
Computer Science [cs]Origin | Files produced by the author(s) |
---|
Loading...