YOUR FEEDBACK
Werner Keil wrote: Java 6 update 10. If I'd be running Apple, I'd probably really drop dead...
AJAXWorld RIA Conference
$300 Savings Expire September 5th. Register Today and SAVE!


2008 East
DIAMOND SPONSOR:
Data Direct
Frontiers in Data Access: The Coming Wave in Data Services
PLATINUM SPONSORS:
Red Hat
The Opening of Virtualization
Intel
Virtualization – Path to Predictive Enterprise
Green Hills
IT Security in a Hostile World
JBoss / freedom oss
Practical SOA Approach
GOLD SPONSORS:
Software AG
The Art & Science of SOA: How Governance Enables Adoption
PlateSpin
Effective Planning for Virtual Infrastructure Growth
Fujitsu
Automated Business Process Discovery & Virtualization Service
Ceedo
Workspace Virtualization
Click For 2007 West
Event Webcasts

2008 East
PLATINUM SPONSORS:
Appcelerator
Think Fast: Accelerate AJAX Development with Appcelerator
GOLD SPONSORS:
DreamFace Interactive
The Ultimate Framework for Creating Personalized Web 2.0 Mashups
ICEsoft
AJAX and Social Computing for the Enterprise
Kaazing
Enterprise Comet: Real–Time, Real–Time, or Real–Time Web 2.0?
Nexaweb
Now Playing: Desktop Apps in the Browser!
Sun
jMaki as an AJAX Mashup Framework
POWER PANELS:
The Business Value
of RIAs
What Lies Beyond AJAX?
KEYNOTES:
Douglas Crockford
Can We Fix the Web?
Anthony Franco
2008: The Year of the RIA
Click For 2007 Event Webcasts
SYS-CON.TV
TODAY'S TOP SOA & WEBSERVICES LINKS


Selecting a VoiceXML Gateway
Selecting a VoiceXML Gateway

As an open standard, VoiceXML truly leverages the knowledge and products that the industry has developed around Advanced Speech Recognition (ASR), Text-to-Speech (TTS), telephony interfaces, voice printing, and Voice over IP (VoIP). As an open standard, what VoiceXML has made possible is flexible models for development and deployment.

Some of my previous articles reviewed deployment tools that are available for developers to rapidly build and test VoiceXML-based interactive speech applications. In this issue I'll preview some options that are available to enterprise speech recognition developers and IT managers for deploying a VoiceXML application.

In the earlier articles we discovered that a number of third-party vendors provide hosted VoiceXML development tools and another group provides desktop-based development tools. Depending on your scenario and your investment in speech recognition, you can pick and choose which development tool model suits your environment. And once you're ready to deploy VoiceXML applications, a number of similar models are available. Although it's definitely possible to buy the entire suite of speech recognition and telephony infrastructure in-house - if you have the investment capability - it's also possible to lease the whole infrastructure, including the telephony lines and integration, from an outsourced service provider.

What Is a VoiceXML Gateway?
As depicted in Figure 1, a VoiceXML gateway is the key link between the telephony infrastructure and your VoiceXML application, representing a suite of integrated technologies. Typically, a gateway includes technology components such as ASR, TTS, VoiceXML interpreter, and telephony integration. Optionally, it can also include voice authentication/voiceprint technology, a set of platform extensions to VoiceXML, and reusable components. Even though I've represented a VoiceXML gateway conceptually as a single piece of infrastructure, it's important to understand that it's typically an integration of multiple technologies. As we'll explore in the rest of the article, some vendors sell these technologies both individually and as an integrated suite of products.

Selecting the Gateway
Developing and deploying speech applications is a challenging task. Typically, developing and deploying an IVR application has meant investing in a set of proprietary technologies and systems hosted either within your own company or as an expensive outsourced offering. VoiceXML has opened up a landscape of speech application deployment to a whole suite of third-party technology providers. A number of vendors have brought in interesting tools and application models to support rapid development and deployment of VoiceXML applications. In a nutshell, three approaches to deploying your VoiceXML applications to the public telephony network have emerged, based on how you set up your key infrastructure link, the VoiceXML gateway:

  • Build: Build your own VoiceXML gateway by integrating a suite of best-of-breed speech recognition technologies. Connect this gateway with the PSTN (public switched telephony network) or an internal VoIP network (if using the application within corporate boundaries).

  • Buy: Buy an integrated VoiceXML gateway. You still need to connect it with the telephony network.

  • Rent: Outsource the functions of the VoiceXML gateway and integration with the telephony network to a service provider. You still have to develop and host your VoiceXML application, though. Of course, you can still utilize your regular Web hosting provider for hosting/colocating your VoiceXML application.

    Key Selection Criteria
    Following are the key selection criteria you should analyze while evaluating the various alternatives:

    • VoiceXML 1.0/2.0 compliance
    • Stability of the solution
      - Availability
      -Vendor presence in your industry
    • Number of concurrent users supported
      -Typical versus spike
    • Grammar formats supported
    • Support for integrated development tools
    • Integration with existing call center systems
    • ASR engines supported
    • TTS engines supported
    • Languages supported
    • Audio formats supported (for prerecorded audio)
    • Reusable components
      -Reusable dialogs
      -Prebuilt library of audio prompts
      -Prebuilt library of grammars
    • Extensive auditing/debugging capability
    • Density (number of ports supported per server)
    • Investment required (different pricing models per port, per minute usage, etc.)
      -Onetime investment required
      -Recurring expense
    The remainder of this article is a critical analysis of the three approaches for deploying VoiceXML applications. The objective is to provide you with the knowledge to make the right decision for your next VoiceXML-based application.

    Build your own VoiceXML gateway
    This approach requires you to integrate the best-of-breed speech recognition technologies, hardware, and telephony platforms and create your own representation of a VoiceXML gateway. Typically, you'd purchase a server (Unix or Windows based), buy telephony integration boards (such as those from Intel Dialogic), and install other components on the same server or, if supported by the vendors, in a distributed scenario. Table 1 lists some key characteristics of the "build" solution.

    Buy integrated VoiceXML gateway
    The major difference between this and the build approach is that you outsource the integration of the suite of speech recognition and text-to-speech technologies to a third-party vendor. Although you'd purchase an integrated VoiceXML gateway from a third-party vendor, you'd still have to integrate it with the telephony infrastructure (lease lines, etc.). Table 2 lists key characteristics of the "buy" solution.

    Rent outsourced VoiceXML gateway
    The "rent" model, also known as a Voice ASP (application service provider) or sometimes a VSP (voice service provider), provides the capabilities of a VoiceXML gateway, including speech recognition, TTS, integration with the telephony system, and so on, in a completely outsourced manner. Typically, the VoiceXML application would be developed on your favorite Web/application server platform and, through either a public Internet or a virtual private network, would link the HTTP/ HTTPS-based speech application to a telephone number(s). Table 3 lists some key characteristics of the "rent" solution.

    Conclusion
    If your company uses an IVR system/technology today, chances are that your IVR provider is already working on being VoiceXML compliant. It's also possible that some components of your existing IVR infrastructure (e.g., speech recognition/ASR, telephony boards, or TTS) can be upgraded/utilized as essential components of a VoiceXML gateway. In these scenarios a build approach may be more suitable.

    However, if your company doesn't have a lot of experience working with the speech recognition/telephony infrastructure, and lowering the cost of initial investment is critical, you'd probably like to test the waters with an outsourced voice service provider. Most such providers provide a free development capability that would allow you to build a prototype and show it to some key users to get feedback. Ultimately you have to consider the criteria that are most critical for your environment and make the build/buy/rent decision for deploying your VoiceXML application.

    About Hitesh Seth
    Hitesh Seth is chief technology officer of ikigo, Inc., a provider of XML-based web-services monitoring and management software. A freelance writer and well-known speaker, he regularly writes for technology publications on VoiceXML, Web Services, J2EE and Microsoft .NET, Wireless Computing & Enterprise/B2B Integration. He is the conference chair for VoiceXML Planet Conference & Expo.

  • YOUR FEEDBACK
    What? wrote: "The last time I was this excited about a new SDK was probably when .NET 2.0 came out" OK, that's the funniest thing I think I've ever read in one of these articles. I didn't realize how completely sarcastic it was, but then I imagined Lewis Black reading it out loud and it finally made sense. This whole article is supposed to be a joke. If that's the case, I guess I'll play along: "Needless to say, my hopes and dreams came crumbling down when I realized that this new fangled iPhone device contained an operating system, and if that isn't bad enough, one that was based on some kinda open source garbage - not even windows CE (the nerve!). So I threw the thing out, and promptly started working on SharePoint Unleashed 2nd ed. where at least nobody has the gaul to use an undocumented API. Why are these people doing that? Huh? Why did they have to go and do that? I mean, wait for the...
    spinron wrote: Having bought and read the pre-release version of the book discussed here ("Rough-Cuts" edition, available on O'Reilly's site for $20), I tend to disagree with Kevin's opinion and lean more towards the book's author's view that the "unofficial" SDK, or the at least the API represented in it, are likely to more-or-less remain equivalent to the ones that would be exposed by the official Apple iPhone SDK. The iPhone platform implements a subset of the Mac OS X API which the book describes quite nicely. Why on earth would Apple want to re-invent a new API just for the iPhone SDK, after it's worked so hard to perfect its API over a decade? For spite, just to break the existing applications and necessitate a rewrite? Not a strong argument here. Seriously, get the rough-cuts edition now and read it. Consider it a preview for the official SDK. Most of the material it discusses is likely to rema...
    Endre Stølsvik wrote: I think this blog entry is stupid. If you're correct, and the book is about jailbroken iPhones, I think it is really cool of O'Reilly to flip the finger at Apple's idiotic attitude. "Confusing the developers" - are you insane or something? Do you believe that you are the only "developer" with more than about 6 brain cells? A "developer" that starts coding on an iPhone without realizing what he's really up against must be fully brain damaged. No, no one will be confuzed. Seriously. ColdFusion Developer's Journal - wow..
    germ wrote: Hello? There are a million hacked iPhones out there. Hacking the iPhone is the only reason to buy it.
    Brett wrote: Surely they can cater for the reality of iPhone usage in the market ? Hacking the phone and breaking the software license agreement isn't necessarily bad or illegal.. depends who you talk to... There are laws that support the consumer's rights to reverse-engineer their device, or to make changes to allow moving to a different carrier (eg the Digital Millennium Copyright Act). Just because it conflicts with the user's agreement with Apple doesn't make it a 'bad thing', it just means they might have to deal with some contractual consequences, or not...
    Pedro wrote: "How many potential developers might stumble upon the information on O'Reilly's site, follow the instructions to start coding, only to eventually realize that customers with unhacked phones can't run their apps??" I think that a person that starts writing code without even notice that it will work only with jailbreaked phones don't have any idea about iphone development and doesn't even deserve the "developer" title.
    iPhone News Desk wrote: So is O'Reilly actually condoning the hacking of the phones? O'Reilly has had a long and prestigious history as being the ultimate source for *nix manuals, including many books that became so dogeared I actually bought multiple copies, including dozens of 'in a nutshell' books.
    XML JOURNAL LATEST STORIES . . .
    To be able to do anything useful, an ESB must be configured with all sorts of parameters, from endpoint connection URIs to message transformation scripts to content-based routing definitions. Moreover, ESBs like Mule can host custom components, which will process messages and perform u...
    Representatives of the state IT organizations of Brazil, South Africa and Venezuela, three of the four countries that protested ISO’s standardization of Microsoft’s Office Open XML (OOXML) file format, have apparently thrown in the towel on taking their appeal any further. India, t...
    Two of the biggest launches in Rich Internet Application history took place in 2007/2008 when Adobe launched AIR 1.0 in February '08 and Microsoft launched Silverlight (September '07). At the 6th International AJAXWorld RIA Conference & Expo in October SYS-CON Events is delighted to be...
    Red Hat CTO Brian Stevens, Citrix CTO Simon Crosby, Egenera CTO Pete Manca, Allen Stewart, Group Manager, Windows Virtualization at Microsoft, and Brian Duckering, Sr. Director of Products and Alliances at Symantec were the top industry executives who joined Jeremy Geelan in the 4th Fl...
    This article is aimed at beginner and intermediate Web developers looking to make the leap into database support of their Web site. The article suggests a new declarative language based on HTML-forms, which is used for development of the database interface. HTML forms can manage not on...
    SUBSCRIBE TO THE WORLD'S MOST POWERFUL NEWSLETTERS
    SUBSCRIBE TO OUR RSS FEEDS & GET YOUR SYS-CON NEWS LIVE!
    Click to Add our RSS Feeds to the Service of Your Choice:
    Google Reader or Homepage Add to My Yahoo! Subscribe with Bloglines Subscribe in NewsGator Online
    myFeedster Add to My AOL Subscribe in Rojo Add 'Hugg' to Newsburst from CNET News.com Kinja Digest View Additional SYS-CON Feeds
    Publish Your Article! Please send it to editorial(at)sys-con.com!

    Advertise on this site! Contact advertising(at)sys-con.com! 201 802-3021


    SYS-CON FEATURED WHITEPAPERS


    ADS BY GOOGLE
    BREAKING XML NEWS

    Security Challenges for the Information Society