Inspire and innovate, emphasizing voice user interfaces, speech recognition and synthesis, NLU, and AIML.
I am focusing on embedded, mobile, and open source technologies and help accelerating the discovery and adoption of emerging mobile technologies.
I created the Java-based open source XUL Engine SwixML, which Sun’s CTO called “The strongest straightforward design of declarative UI implementations”.
SwixML represents ideas that today are heavily re-used in Google’s Android SDK. (Graphical User Interfaces are described declaratively in XML documents that are parsed and rendered into UI widgets hierarchies at runtime.)
But I have create so much more software, I’m extremely proud of.
A lot of my work evolves around early technology prototyping. Still, I’m trying to put some ideas into real world mobile applications.
Take a look at Artist on Android, the Horsemen of Speech Recognition, or other apps that I have published under the Techcasita Productions brand in Google’s play store.
I’m appointed to the advisory committee for the Mobile App Development Certificate at the University of California, Irvine, and occasionally speaks at conferences and user groups on topics ranging from Embedded Technology to Declarative Programming, emphasizing UI Generation at Runtime, and everything Voice User Interface related of course.
Have a look at some slide from my most recent talks.
Many new concepts that I implement in mobile applications, are communicated best through video clips or short films; and I’m not talking about simple screen grabs.
Take a look at some high quality short HD films that I have created over the last few months and years.
Amateur professionalism, a concept used since 2004, describes an emerging sociological and economic trend of people pursuing amateur activities to professional standards. Well … that pretty much describes how I look my photography work today. If you like, take a look at some of my photos and the stories behind them, at http://ramonaphoto.com
The main idea for this blog post was using the Mac, for quickly developing and deploying a Web app in Java. I’m using RestEasy on top of Tomcat as my server platform and Intellij IDEA as my preferred IDE. Once the webapp has been built and tested locally, IDEA is also used for archiving the webapp and remotely deploying it to a Linux box.
RESTEasy is used to build a simple wrapper, exposing the implemented functionality as Web Service, but without having to modify the core implementation and to keep it independent from all Web-Service related resources.
As most of us, who still maintain their own web presence, instead of totally giving in to FaceBook or posting exclusively on Google+ maybe, I’m never really satisfied with the layout and look and feel of my site.
I don’t remember the exact date, but I published my 1st HTML on a server running at the University of Marburg Germany, in fall 1995. I still remember the URL, which appears on the WayBackMachine as early as May 2nd 1997.
Yesterday was a big or maybe even a huge day, for voice user interfaces.
Microsoft introduced us to Cortana and Amazon introduced the Fire TV box, which includes a remote control, supporting Voice input. Considering that both are not 1st to market (SIRI, ComCast’s x1-xfinity), their entry further validates VUIs.
FireTV’s voice recognition seems so good that when Jon Fortt was demonstration it today on live television (CNBC), he dared to ask it for “Pawn Stars” http://video.cnbc.com/gallery/?video=3000264102
Wedbush analyst Shyam Patil wrote that Nuance Communications was likely powering Amazon.com’s Fire TV voice search, while reiterating a neutral rating and $15.00 price target on Nuance. (Nuance traded today for $17.59)
The consolidation regarding Speech Recognition and Synthesis seems to continue. Apple has acquired speech recognition pioneer Novauris last year, but this had not been announced until today. One of the biggest differentiators about Novauris in terms of the competitive landscape, is that they operated in both the embedded (i.e. on-device, like OpenEars, PocketSphinx) and server space (like LumenVox, Nuance), and they also owned the core engine.
“NovaSearch doesn’t carry out recognition at the word or sequence-of-words level, but rather identifies complete phrases from start to finish by matching them against a potentially huge inventory of possible utterances. This enables it to assemble information about what has been spoken over utterances of virtually any length and take near-optimal decisions.”