I did something last week I hadn’t done for quite some time: I spoke at a conference in front of a live audience. I started speaking at conferences in 2002 after I had created the SwiXml open source project: Graphical User Interfaces were described in XML documents, parsed at runtime, and rendered into Java objects – Android […]
speech recognition
A Universal Voice Browser

On September 25, Amazon released 80 new devices, some of which can be found here. Even for a company of its size, that is an impressive number and an even more impressive line-up. However, I think something much more profound happened a day earlier. On Tuesday, September 24, 2019, at 11:04 AM EDT, in a […]
Custom wakeup-words for an Android app

With most modern Android phones, just saying the phrase “OK Google” will launch the Google assistant app, which is capable of answering simple questions, or functioning as a app launcher. Following up with “open g mail” will launch the Gmail app on your phone, or saying “navigate home”, will open the Google Maps app, with […]
How to get conversational UI right

Cross-Post from my article on VentureBeat.com With the rise of AI, voice, and more generally language-driven technologies — like chatbots, Siri, and Amazon Echo — conversational user interfaces (CUI) have a chance of becoming the next major technology platform after mobile. The field of conversational UI holds a lot of promise in terms of how […]
SpeechTEK 2017, Washington, District Of Columbia

— The Einstein Memorial – National Academy of Sciences, 2101 Constitution Ave NW, Washington, DC 20418 “New uses of speech technologies are changing the way people interact with companies, devices, and each other. Speech frees users from keyboards and tiny screens and enables valuable, effective interactions in a variety of contexts.” Clearly focused, SpeechTek 2017 was intended […]
The Path to the CUI is Heavily Mined and Booby-Trapped

Cross-Post from my article in Chatbots Magazine The concept of the Conversational User Interface (CUI) is not really new. Wolfgang Wahlster of the German Research Center for AI, DFKI, wrote 12 years ago in his paper on Conversational User Interfaces: “Conversational user interfaces allow various natural communication modes like speech, gestures and facial expressions for […]
Conversational Interaction Conference

The CI-Conference is the successor of the Mobile Voice Conference, and like its predecessor, organized by Bill Meisel and AVIOS (Applied Voice Input Output Society). The two day conference (1/30-31) ran like clockwork at the Westin in San Jose, had a keynote, two keynote panels, and 26 sessions. What makes this conference unique, is how it balances academia and […]
Raspberry Pi – Translator

Recently, I described how to perform speech recognition on a Raspberry Pi, using the on device sphinxbase / pocketsphinx open source speech recognition toolkit. This approach works reasonably well, but with high accuracy, only for a relatively small dictionary of words. Like the article showed, pocketsphinx works great on a Raspberry Pi to do keyword […]
Mobile Voice Conference 2015

The Applied Voice Input Output Society (AVIOS) and TMA Associates organize the annual Mobile Voice Conference, which this year took place at the Sainte Claire Hotel in San Jose, California on April 20 and 21. The Mobile Voice Conference examines the current state of speech recognition, speech synthesis, and natural language understanding technology, what it […]
E*Trade Mobile – Voice Commands

ETrade provides a great mobile app experience on iPhone, iPad, Android, Windows Phone, and Blackberry. I think it’s almost expected that the feature-set provided by the dedicated native mobile applications are not quite the same. The Windows version and especially the one for Blackberry fall far behind what ETrade has to offer on Android and […]