RSS

Wolf Paulus' Journal

Inspire • Innovate • Invent

This is my blog, where I share quick thoughts, links, and ideas.

read more

Voice User Interfaces

A more diversified mobile device landscape requires us to rethink established UX patterns. For next generation mobile devices, Voice for input and output, will be significant. Traditional UI-Widgets simply don’t work on wear-ables and in-car systems.

read more

The Highway Platform

An aggregator that analyses, summarizes, synthesizes, and augments information, found in various data feeds. Information incl. text, images, GPS, and seismic data is then pushed into a highly customized CMS

read more
about

About Me

My name is Wolf Paulus, a photographer, hiker, hacker, technologist based in Ramona, California.

Inspire and innovate, emphasizing voice user interfaces, speech recognition and synthesis, NLU, and AIML.

I am focusing on embedded, mobile, and open source technologies and help accelerating the discovery and adoption of emerging mobile technologies.

Software

I created the Java-based open source XUL Engine SwixML, which Sun’s CTO called “The strongest straightforward design of declarative UI implementations”.

SwixML represents ideas that today are heavily re-used in Google’s Android SDK. (Graphical User Interfaces are described declaratively in XML documents that are parsed and rendered into UI widgets hierarchies at runtime.)

But I have create so much more software, I’m extremely proud of.

mobile

Mobile

A lot of my work evolves around early technology prototyping. Still, I’m trying to put some ideas into real world mobile applications.

Take a look at Artist on Android, the Horsemen of Speech Recognition, or other apps that I have published under the Techcasita Productions brand in Google’s play store.

Cloud Services

Most mobile applications consume some sort of cloud service. Speed is extreme important for Voice User Interfaces to work well, which means you want to do as much as possible on-device. However, speech recognition accuracy and speech synthesis quality often requires to implement these services in the cloud. Related services that I have recently implemented as cloud services include aggregation, text summarization, and sentiment analysis.

PRESENTATIONS

I’m appointed to the advisory committee for the Mobile App Development Certificate at the University of California, Irvine, and occasionally speaks at conferences and user groups on topics ranging from Embedded Technology to Declarative Programming, emphasizing UI Generation at Runtime, and everything Voice User Interface related of course.

Have a look at some slide from my most recent talks.


INSPIRING VIDEOS

Many new concepts that I implement in mobile applications, are communicated best through video clips or short films; and I’m not talking about simple screen grabs.

Take a look at some high quality short HD films that I have created over the last few months and years.


STUNNING PHOTOS

Amateur professionalism, a concept used since 2004, describes an emerging sociological and economic trend of people pursuing amateur activities to professional standards. Well … that pretty much describes how I look my photography work today. If you like, take a look at some of my photos and the stories behind them, at http://ramonaphoto.com


RECENT POSTS

Java 8, Tomcat 8, and RESTEasy on Mac OS X

The main idea for this blog post was using the Mac, for quickly developing and deploying a Web app in Java. I’m using RestEasy on top of Tomcat as my server platform and Intellij IDEA as my preferred IDE. Once the webapp has been built and tested locally, IDEA is also used for archiving the webapp and remotely deploying it to a Linux box.

RESTEasy is used to build a simple wrapper, exposing the implemented functionality as Web Service, but without having to modify the core implementation and to keep it independent from all Web-Service related resources.

read more…

My WWW History

As most of us, who still maintain their own web presence, instead of totally giving in to FaceBook or posting exclusively on Google+ maybe, I’m never really satisfied with the layout and look and feel of my site.

I don’t remember the exact date, but I published my 1st HTML on a server running at the University of Marburg Germany, in fall 1995. I still remember the URL, which appears on the WayBackMachine as early as May 2nd 1997.

read more…

Voice User Interfaces

Yesterday was a big or maybe even a huge day, for voice user interfaces.
Microsoft introduced us to Cortana and Amazon introduced the Fire TV box, which includes a remote control, supporting Voice input. Considering that both are not 1st to market (SIRI, ComCast’s x1-xfinity), their entry further validates VUIs.

FireTV’s voice recognition seems so good that when Jon Fortt was demonstration it today on live television (CNBC), he dared to ask it for “Pawn Stars” http://video.cnbc.com/gallery/?video=3000264102
Wedbush analyst Shyam Patil wrote that Nuance Communications was likely powering Amazon.com’s Fire TV voice search, while reiterating a neutral rating and $15.00 price target on Nuance. (Nuance traded today for $17.59)

The consolidation regarding Speech Recognition and Synthesis seems to continue. Apple has acquired speech recognition pioneer Novauris last year, but this had not been announced until today. One of the biggest differentiators about Novauris in terms of the competitive landscape, is that they operated in both the embedded (i.e. on-device, like OpenEars, PocketSphinx) and server space (like LumenVox, Nuance), and they also owned the core engine.
“NovaSearch doesn’t carry out recognition at the word or sequence-of-words level, but rather identifies complete phrases from start to finish by matching them against a potentially huge inventory of possible utterances. This enables it to assemble information about what has been spoken over utterances of virtually any length and take near-optimal decisions.”


Twitter

Follow me on Twitter

Follow

LinkedIn

View my profile on LinkedIn

View Profile