YOUR FEEDBACK
John Portnov wrote: This code does not work for me. I created a new website and a C# console applic...
AJAXWorld RIA Conference
$300 Savings Expire August 22
Register Today and SAVE!


2008 East
DIAMOND SPONSOR:
Data Direct
Frontiers in Data Access: The Coming Wave in Data Services
PLATINUM SPONSORS:
Red Hat
The Opening of Virtualization
Intel
Virtualization – Path to Predictive Enterprise
Green Hills
IT Security in a Hostile World
JBoss / freedom oss
Practical SOA Approach
GOLD SPONSORS:
Software AG
The Art & Science of SOA: How Governance Enables Adoption
PlateSpin
Effective Planning for Virtual Infrastructure Growth
Fujitsu
Automated Business Process Discovery & Virtualization Service
Ceedo
Workspace Virtualization
Click For 2007 West
Event Webcasts

2008 East
PLATINUM SPONSORS:
Appcelerator
Think Fast: Accelerate AJAX Development with Appcelerator
GOLD SPONSORS:
DreamFace Interactive
The Ultimate Framework for Creating Personalized Web 2.0 Mashups
ICEsoft
AJAX and Social Computing for the Enterprise
Kaazing
Enterprise Comet: Real–Time, Real–Time, or Real–Time Web 2.0?
Nexaweb
Now Playing: Desktop Apps in the Browser!
Sun
jMaki as an AJAX Mashup Framework
POWER PANELS:
The Business Value
of RIAs
What Lies Beyond AJAX?
KEYNOTES:
Douglas Crockford
Can We Fix the Web?
Anthony Franco
2008: The Year of the RIA
Click For 2007 Event Webcasts
SYS-CON.TV
TODAY'S TOP SOA & WEBSERVICES LINKS


Tools for Developing VoiceXML Applications Part 2
Tools for Developing VoiceXML Applications Part 2

Last year in this column (XML-J, Vol. 2, issue 2) we reviewed an initial set of VoiceXML development tools and integrated development environments (IDEs). Now we'll review some of the new tool sets developed by independent software vendors for the development and testing of dynamic speech-based components and applications.

I'll focus on how to use these tools to test and debug VoiceXML applications from a desktop-based development environment and/or a normal touch-tone-based phone.

Note/Disclaimer: This article doesn't attempt to compare the capabilities and features of the various tools, but instead provides information on the various development tools in a concise fashion. To ensure neutrality, the vendors have also been listed in alphabetical order. Even though I've tried to provide the latest and most correct information and compliance, please validate it with respect to the VoiceXML specifications through the appropriate vendors.

GetVocal VoiceXML SDK
VoiceXML SDK from GetVocal (a division of Digital Voice Technologies) is a desktop-based Visual VoiceXML development and testing tool. It supports the latest (VoiceXML 2.0 draft) version of the specification including support for XML-based grammars. It's built on top of Microsoft Speech SDK 5.1 and uses Microsoft TTS and ASR engines for text-to-speech and speech recognition.

Graphical testing and development is the key highlight of the SDK (see Figure 1), which also includes a virtual phone pad for real-phone simulation, a graphical VoiceXML generator, scripting support through JavaScript VBScript (with extensions for invoking ActiveX components and Java classes), and a complete visual debugging environment with steps, breakpoints, and variable watches.

Apart from testing local VoiceXML content, the SDK also supports visual interpretation and testing of remote/dynamic VoiceXML applications. A distinctive highlight of the SDK is that apart from the simulation through a virtual phone, it can actually use a TAPI-based interface to interact with a live phone through a voice modem or an installed dialogic card. This feature converts the SDK from a desktop-based testing tool to a full-featured single-port VoiceXML gateway. The SDK also includes a complete VoiceXML tag reference and sample VoiceXML applications.

Included with the SDK is GetVocal VoiceXML Editor, a rich editor that features syntax highlighting and configurable tag shortcuts. A time-limited evaluation version of GetVocal SDK is available for Windows 98/ME/NT/2000/XP platforms from www.getvocal.com/vxml.asp.

HeyAnita FreeSpeech Developer Network
FSDN is a hosted VoiceXML platform. The tools include integrated development environments for developing VoiceXML applications and XML-based grammars. The Web-based integrated development environment approach provides flexibility by allowing VoiceXML documents and external grammars to be stored in multiple locations: a self-managed online scratchpad, uploaded files, and external Web servers.

Applications built using the FreeSpeech Developer Network use XML grammars that are based on the Speech Recognition Grammar Specification draft available from W3C (www.w3.org/TR/speech-grammar), now part of VoiceXML 2.0 (see sidebar). To jump-start grammar development there are a couple of tools such as a wizard-based grammar generator (see Figure 2) and an online grammar parser to test utterances.

SIDE BAR
Update: VoiceXML 2.0 Draft Specification Is Now Available

In October 2001, W3C released a draft of VoiceXML 2.0. VoiceXML 1.0 was released by the VoiceXML Forum in March 2000. Some highlights of the 2.0 draft:

  • The W3C Voice Browser Activity (www.w3.org/voice) has officially taken over the management and technical development of the VoiceXML specification. This is a good step for the development of VoiceXML as W3C is a great standards organization. The VoiceXML Forum (www.voicexml.org) continues to focus on conformance testing and a branding program for VoiceXML.
  • Mandatory support for XML-based Speech Synthesis Markup Language (www.w3.org/TR/speech-synthesis).
  • Mandatory support for XML-based Speech Recognition Grammar Format (www.w3.org/TR/speech-grammar). This is critical as it will result in better conformance between the different vendor implementations of VoiceXML 2.0. In addition to the XML-based format, VoiceXML 2.0 implementations can optionally support the augmented BNF form of the grammar format specification as well as other formats such as JSGF.
  • A number of other changes have been recommended, such as changes in both elements and attributes.
    END SIDE BAR

    To assist in development, FSDN provides technical resources such as a VoiceXML reference, a grammar reference, FAQ, tutorials, and a prebuilt VoiceXML and grammar library. For communication between developers, FSDN includes NNTP-based threaded forums. FreeSpeech Developer Network is available at http://freespeech.heyanita.com.

    IBM WebSphere Voice Toolkit
    WebSphere Voice Toolkit is a desktop-based IDE that's built on top of IBM's next-generation open-source development tool platform, Eclipse (www. eclipse.org). Voice Toolkit is really written as a plug-in to the underlying core Eclipse platform. This allows Voice Toolkit to inherit the advanced IDE features provided by the open-source Eclipse IDE and IBM's commercial tools built on top of Eclipse. For instance, Voice Toolkit can inherit other Eclipse modules (plug-ins) for server-based dynamic application development with IBM WebSphere Application Developer Studio to provide an integrated server-based J2EE/VoiceXML application development with full-fledged features for J2EE application development and common IDE features such as CVS version management and project management.

    For VoiceXML applications, Voice Toolkit leverages IBM Voice SDK (reviewed in this column in XML-J, Vol. 2, issue 2) to interface with desktop text-to-speech and recognition engines. This integration enables the toolkit to test the VoiceXML content in both audio and text simulation modes. In addition, Toolkit includes tools for editing and testing Java Speech Grammar Format (JSGF) and Speech Recognition Control Language (SRCL) based grammars (see Figure 3), a color-coded VoiceXML editor with a content assist feature to select elements and attributes, an audio recorder for creating and playing .au/.wav prompts, a pronunciation builder for the WebSphere VoiceXML Server/SDK, wizards for creating database-based dynamic Web applications using JSP and JavaBeans, and a voice application debugger.

    WebSphere Voice Toolkit is available for Windows 2000-based operating systems from www-3.ibm.com/software/speech/enterprise/vtoolkit.html.

    Telera DeVXchange AppBuilder
    DeVXchange AppBuilder from Telera Inc. is a desktop-based visual VoiceXML application generation IDE. At first it gives the impression that it's a traditional IVR toolset, and even its drag-and-drop palette-building blocks don't directly map to VoiceXML elements. However, with a little bit of exposure you can understand that the tool focuses on generating the underlying dynamic scripting code in VoiceXML 1.0.

    Figure 4 shows a simple menu-based dialog built using the visual tool. The bottom window shows the VoiceXML dynamically generated. In addition, the tool features integration with Visual SourceSafe for configuration management, FTP-based remote publishing, rich building blocks for a call-flow scheduler, a task balancer, logging, speech recording, variable-based call routing, database query, call transfer, call bridging, and the ability to create custom reusable blocks based on XML. The tool differs from a number of other VoiceXML IDEs as it focuses on the server-based ASP/JSP-based dynamic application generation rather than the VoiceXML template itself.

    Telera AppBuilder runs on top of Windows NT/2000 platforms and requires either Microsoft Internet Information Server 4.0/5.0 and/or a JSP/servlet engine such as Apache Tomcat (included with the installation). More information about AppBuilder is available at www.telera.com/appbuilder.html.

    VoiceGenie Genie IDE
    Genie IDE from VoiceGenie Technologies is a desktop-based toolkit for developing and testing local and remote VoiceXML-based applications. Genie 1.0 (see Figure 5) is a full-featured environment for the interactive development of VoiceXML applications, including project management with FTP-based publishing support, syntax highlighting, indentation, a palette-like tag manipulation tool for easy tag reuse, configurable DTD support, and coding assistance through tag and attribute choices. The IDE also includes online help, sample VoiceXML files, and a step-by-step tutorial.

    A distinct highlight and highly productive feature of the IDE is the wizard-based support for VoiceXML generation. Using intuitive and simple dialogs, wizards can help create both VoiceXML dialogs and forms and XML- and ABNF-based grammars. Genie 1.0 leverages technologies such as the latest VoiceGenie VoiceXML interpreter and the Microsoft SAPI interface for Text to Speech (TTS).

    Apart from the development and testing of locally created .vxml files, Genie 1.0 IDE also supports the testing of remote (URL-based) VoiceXML applications. This feature can be used to test application server-based dynamic VoiceXML applications. In essence, the local IDE can be used to create the VoiceXML content template and additional dynamic scripts and tags (such as Perl/JSP/ASP/ASP.NET), then be served through a Web or application server and tested remotely. To aid the dialog flow, Genie 1.0 also provides a visual tool called "Show Dialog Flow," which constructs a simple flow diagram.

    Genie IDE 1.0 is available officially for Windows 2000 (Service Pack 1). However, I tested my version on Windows XP and it was pretty stable. A 90-day evaluation copy of the Genie 1.0 IDE is available for download from http://developer.voicegenie.com/IDE.php.

    Voxeo Designer
    Voxeo Designer from Voxeo Corporation is a Java-based visual VoiceXML editor. It provides an integrated round-trip source and visual dialog design editor with a focus on visual representation of the dialog flow (see Figure 6). It uses a context-sensitive, palette-based toolbar for visual drag-and-drop-based VoiceXML generation. It includes online help and allows remote editing and publishing of VoiceXML content.

    Designer is available for Windows platforms from http://community.voxeo.com/vd2.jsp.

    Conclusion
    The depth and breadth of these additional tool sets for interactive design, development, and testing of VoiceXML applications illustrates that the whole VoiceXML-based speech application development and deployment landscape is maturing, leveraging the advances and features that we've seen in the traditional Web application development space. This year we should see a boost in the world of VoiceXML, accelerated by the richness of some of these development tools and technologies. Table 1 provides a summary of the tools reviewed in this article.

    Next I'll focus on how we can leverage Voice over IP (VoIP) to develop and deploy VoiceXML-based speech applications on IP-based telephony networks.

    References

  • W3C Voice Browser Activity: www.w3.org/Voice/
  • VoiceXML 2.0 Draft Specification: www.w3.org/TR/voicexml20/
    About Hitesh Seth
    Hitesh Seth is chief technology officer of ikigo, Inc., a provider of XML-based web-services monitoring and management software. A freelance writer and well-known speaker, he regularly writes for technology publications on VoiceXML, Web Services, J2EE and Microsoft .NET, Wireless Computing & Enterprise/B2B Integration. He is the conference chair for VoiceXML Planet Conference & Expo.

  • YOUR FEEDBACK
    XML Developer's Journal News Desk wrote: DocSoft says its new enterprise search technology, called 'Element' unleashes the power of structured data within a company's repository. With Element as a plug-n-play addition to a company's network, users will be able to search for stored data across the network 'smarter,' thus more efficiently, the company says. Element indexes XML-originated documents according to each tag or 'element,' which provides what the company calls 'context searching.' Element can even search metadata embedded into virtually any file format using Adobe's eXtensible Metadata Platform (XMP).
    XML JOURNAL LATEST STORIES . . .
    ISO said Friday that the appeals made by Brazil, India, South Africa and Venezuela protesting the standardization of Microsoft’s Office Open XML (OOXML) file format hadn’t gone anywhere – it was unclear whether any of them had any standing anyway – but since they “failed to g...
    Red Hat CTO Brian Stevens, Citrix CTO Simon Crosby, Egenera CTO Pete Manca, Allen Stewart, Group Manager, Windows Virtualization at Microsoft, and Brian Duckering, Sr. Director of Products and Alliances at Symantec were the top industry executives who joined Jeremy Geelan in the 4th Fl...
    Two of the biggest launches in Rich Internet Application history took place in 2007/2008 when Adobe launched AIR 1.0 in February '08 and Microsoft launched Silverlight (September '07). At the 6th International AJAXWorld RIA Conference & Expo in October SYS-CON Events is delighted to be...
    Since its inception, XML has been criticized for the overhead it introduces into the enterprise infrastructure. Business data encoded in XML takes five to 10 times more bandwidth to transmit in the network and proportionally more disk space to store.
    Vordel unveiled version 5.1 of its XML network infrastructure products, to accelerate, manage and protect XML applications. Vordel 5.1 addresses the need for lifecycle management of policy across the SOA. By combining the central management of SOA policies with distributed enforcement ...
    SUBSCRIBE TO THE WORLD'S MOST POWERFUL NEWSLETTERS
    SUBSCRIBE TO OUR RSS FEEDS & GET YOUR SYS-CON NEWS LIVE!
    Click to Add our RSS Feeds to the Service of Your Choice:
    Google Reader or Homepage Add to My Yahoo! Subscribe with Bloglines Subscribe in NewsGator Online
    myFeedster Add to My AOL Subscribe in Rojo Add 'Hugg' to Newsburst from CNET News.com Kinja Digest View Additional SYS-CON Feeds
    Publish Your Article! Please send it to editorial(at)sys-con.com!

    Advertise on this site! Contact advertising(at)sys-con.com! 201 802-3021


    SYS-CON FEATURED WHITEPAPERS


    ADS BY GOOGLE
    BREAKING XML NEWS
    Avineon, Inc. (http://www.avineon.com), a successful provider of IT, geospatial, engineering and pro...