Context Navigation

ProjectWork

Timestamp:: Sep 21, 2020, 12:05:54 PM (5 years ago)
Author:: Ales Horak
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

en/ProgrammingRobotsCourse/ProjectWork

-                      v15
+                      v16
 . **Dialog about learning human faces:** see [http://doc.aldebaran.com/2-5/naoqi/peopleperception/alfacedetection.html ALFaceDetection] and [http://doc.aldebaran.com/2-5/naoqi/core/alusersession.html ALUserSession]
 . **Dialog about learning object images:** see [http://doc.aldebaran.com/2-5/naoqi/vision/alvisionrecognition.html ALVisionRecognition]
+. **Dialog about object recognition from cloud:** various [https://github.com/tensorflow/models/tree/master/research/object_detection models] for general object detection are available to install and run on a GPU-available computer, i.e. not on the robot. In this dialog, the robot sends [http://doc.aldebaran.com/2-5/naoqi/vision/alvideodevice.html image from camera] to a selected object recognition installed on a computer in local network and tells the results.
 . **Game 20 questions**:  see [https://en.wikipedia.org/wiki/Twenty_Questions Wikipedia], in Czech the game of "myslím si zvíře".
 . **Game Guess movie/song by sample:** play a short sample of a (Czech) movie or a (Czech) song and ask people to guess the name or interpreter
 …
 . **Dialog about robot animations:**  Pepper has about [http://doc.aldebaran.com/2-5/naoqi/motion/alanimationplayer-advanced.html#animationplayer-list-behaviors-pepper 400 animations installed] which express various situations, emotions etc. In the dialog, the user should be able to ask Pepper to show any of them. Possibly can include adding special animations such as offering a hand or standing in posture for a selfie.
 . **Pepper presentator**: robot obtains presentation slides with texts and presents them to the public. Robot accompanies the presentation with gestures and allows interruptions and continuation.
 . **Dialog with free recognition**: the standard [http://doc.aldebaran.com/2-5/naoqi/interaction/dialog/aldialog.html ALDialog] allows recognition of only predefined words and word combinations. [https://cloud.google.com/speech-to-text/ Google Speech API] (or an [https://www.g2.com/products/google-cloud-speech-to-text/competitors/alternatives alternative]) can recognize any text, but a) it is free only for short samples, b) it imposes a delay between sending the audio over network to the service. The task here is to combine these two approaches in one application seamlessly.
+. **Dialog with free speech recognition**: the standard [http://doc.aldebaran.com/2-5/naoqi/interaction/dialog/aldialog.html ALDialog] allows recognition of only predefined words and word combinations. [https://cloud.google.com/speech-to-text/ Google Speech API] (or an [https://www.g2.com/products/google-cloud-speech-to-text/competitors/alternatives alternative]) can recognize any text, but a) it is free only for short samples, b) it imposes a delay between sending the audio over network to the service. The task here is to combine these two approaches in one application seamlessly.
 . **Dialog about celebtrities**: robot should download information from [https://autority.nkp.cz/ Czech National Authorities] portal (accessible also as  [https://blog.wikimedia.cz/2020/01/773-571-osob-pod-licenci-cc-0-databaze-jmennych-autorit-narodni-knihovny-byla-integrovana-s-wikidaty/ linked data] at https://autority.wikimedia.cz), process the most frequent names and be able to discuss the information about the respective celebrities.