WO2001073750A2 - Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback - Google Patents

Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback Download PDF

Info

Publication number
WO2001073750A2
WO2001073750A2 PCT/US2001/009705 US0109705W WO0173750A2 WO 2001073750 A2 WO2001073750 A2 WO 2001073750A2 US 0109705 W US0109705 W US 0109705W WO 0173750 A2 WO0173750 A2 WO 0173750A2
Authority
WO
WIPO (PCT)
Prior art keywords
business
voice
information
speech
caller
Prior art date
Application number
PCT/US2001/009705
Other languages
French (fr)
Other versions
WO2001073750A3 (en
Inventor
Ahmet Alpdemir
Original Assignee
Dialsurf, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/534,700 external-priority patent/US6658389B1/en
Application filed by Dialsurf, Inc. filed Critical Dialsurf, Inc.
Priority to AU2001249478A priority Critical patent/AU2001249478A1/en
Priority to CA002404343A priority patent/CA2404343A1/en
Publication of WO2001073750A2 publication Critical patent/WO2001073750A2/en
Publication of WO2001073750A3 publication Critical patent/WO2001073750A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4938Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals comprising a voice browser which renders and interprets, e.g. VoiceXML
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition

Definitions

  • This invention pertains generally to voice-recognition based or speech- recognition-based interactive electronic commerce, and more particularly to systems, methods, and methods of doing business for providing automated interactive directory assistance information from a business or organization to a consumer in need of goods and/or services.
  • the invention pertains even more particularly to systems, methods, and methods of doing business for providing automated speech-recognition driven query and response with business or event self-promotion features relative to businesses and events over ordinary wired or wireless telephone systems, PC systems, Personal Data Assistants (PDAs), and other communication and information appliances and devices.
  • PDAs Personal Data Assistants
  • Locating business establishments such as for example a restaurant satisfying the particular need of a customer, has hereto for generally required access to printed directory listings, or more recently access to the World Wide Web using a personal computer.
  • the availability of such references is frequently quite limited at the time the consumer desires to avail themselves of particular goods or services. For example, an out-of-town visitor driving in their automobile and approaching San Francisco might decide to stop and have dinner in a fine Italian restaurant and more particularly might like to have dinner in a fine Italian restaurant located in the particular area of the city., or to partake of a particular gastronomic delicacy. That visitor would likely not have a printed directory in their automobile or mobile access to the Internet to search for a restaurant satisfying their current need.
  • the visitor would likely either have to stop and asked for recommendations or drive around until a restaurant satisfying their needs might be located. Perhaps tens of minutes or hours later than desired. This approach is clearly inefficient, and the visitor may not have the dining experience expected if the restaurant they happen to see while driving turns out to have poor quality food, poor service, or both.
  • a cellular telephone will typically have only a few display lines presenting text or symbolic data to a user, but has substantial audio input and audio output capability that can be used by the consumer.
  • speech-to-text conversion has generally been limited to word processing and or computer or control applications as the has required fairly substantial processing power and memory within a computer device.
  • speech to text conversion products made by Dragon Systems generally require an Intel Pentium II or Pentium III microprocessor, AMD K-6 or Athlon processor, or the like running in excess of 450 MHz and 128 MB of memory.
  • Intel Pentium II or Pentium III microprocessor AMD K-6 or Athlon processor, or the like running in excess of 450 MHz and 128 MB of memory.
  • Text- to-speech conversion has been known but has not been utilized to provide an interactive interface between consumers and consumer information from telephone systems. Such continuous speech recognition systems also usually require voice training with the ultimate user to provide satisfactory results.
  • Some conventional systems and methods have been limited to playback of recorded audio or audio playback corresponding to the content of web pages; but such systems have not integrated Internet or web-based interactions with voice or telephone based information provision. They have also frequently provided inferior voice interfaces that have annoyed callers rather than having provided a useful information experience.
  • Businesses particularly local businesses make sacrifices when relying on printed Yellow Pages because they are expensive, difficult to sort prices and options, cannot be changed once in print, and their effectiveness cannot readily be determined without additional time consuming and expensive surveys. From customers, or potential customers, perspective sacrifices are made because they are too heavy to carry, present too many choices so that it is difficult for a customer to figure out which one to call as they usually lack the information that may effectively provide the decision criteria, the information is usually stale (12-16 months old is typical), and many customers may forget to take advantage of coupon offers even when available.
  • Local directory assistance and more recently national directory assistance through either a 411 type information service is also limited. From a local merchant's perspective, even when a potential customer receives a correct number the connection may not be made because either the customer must redial to get the number or pay an additional fee to be connected. From a customer's perspective, there is relatively easy access but at a cost of between about $0.50 and $1.00 per call. The directory, particularly when using the wireless directory assistance, is notorious for providing wrong numbers and there is no ability to get the correct number without re-dialing and paying an additional fee. Additional charges are also typically billed for requesting address information if available.
  • On-line coupons are somewhat of an improvement however they still present issues. They are expensive to place at high traffic portals for local merchants, and may still be lost in the noise of other promotions. There is low traffic at coupon only sites and their reach is limited due to the need for an Internet connection. The use of coupons may also be favored by groups that may not have ready access to the Internet.
  • the invention provides a system, method, and business model for an information system and service having business self-promotion features, including voice coupons, discounts, sales promotions or other special deals, ratings, and different categories of sponsorship and visibility to the calling public.
  • the inventive business model is directed to a business in which consumers call into a service using an ordinary telephone, PC, PDA, or other information appliance, and make requests in plain speech for information and positive referrals on goods and/or services, and the service provides responses to the request in plain speech in real-time over the same telephone, PC, PDA, or other information appliance.
  • the business model may further include providing a facility for a business to communicate a self-promotion of the business to the requestor, as well as providing an audio promotional coupon (or other promotional item) to a requestor when the requestor completes a call to a business using the service.
  • inventive system extends these features to the Internet and inter-operates with the Internet in a web page based embodiment to provide an additional portal and greater interactive capability.
  • the invention provides an operating model for a telephone- based audio or speech recognition and text to voice interfaced goods and services information, enhanced 41 1 type directory assistance and referral service having merchant self-promotion features, comprising: an information database provider storing merchant information; a merchant interface for inputting merchant information into the database and for retrieving and editing the information; and a consumer interface for inputting voice commands and data and for receiving merchant information and processed information from the database in response to the input voice commands and data.
  • the operating method may provide that the consumer interface comprises a telephone handset, and/or that the consumer also inputs non-voice commands and data from a keypad on the telephone handset.
  • the operating model may also provide that the telephone handset comprises a mobile telephone.
  • the invention provides a system comprising: a speech-to-text conversion engine converting speech-based input commands and data received from an external device over a communication link into text-based commands and data; a data base storing a plurality of data items; a database search engine searching the database for a particular data item in response to the text-based command and data; a text-to- speech conversion engine generating a speech-based representation of the particular data item identified in the database search; and a speech server for communicating the speech-based representation of the particular data item to the external device.
  • the invention provides audio coupons that operate as incentives for consumers to use the inventive system.
  • the invention provides system and methods for submitting and retrieving ratings for goods and or services.
  • the inventive system and method assists in providing directory driven wireless commerce.
  • the inventive system and method provide a promotion and advertising channel that has geographical and sociological reach and the speed needed in today's dynamic financial and commercial markets.
  • the inventive system and method provide a voice- interactive dynamic market place where individuals (particularly locals) call to save and businesses (particularly local merchants) call to publish sales promotions in real-time or near real-time.
  • the inventive system and method communicate information on an as requested basis that goes beyond the telephone number.
  • inventive system and method provide instant savings with voice coupons published by local merchants.
  • inventive system and method provide an advertising free initial experience where voice or audio coupons are only heard attached to businesses that the caller has requested or searched for.
  • inventive system and method provide for hands-free navigation with voice commands on any telephone or device supporting telephony.
  • the invention provides a low cost 411 directory assistance with added informational features that is easy to access.
  • the invention provides benefits to merchants including but not limited to targeted reach, instant promotion, instant or near-term feedback, and an optional free Internet web presence.
  • the invention provides benefits to common carriers and telephone companies who save conventional 411 costs, process higher call volumes, and attract new customers.
  • the invention provides a business model in which voice coupons are sold for distribution, a monthly fee is charged for subscription to the basic service, and additional charges are levied and collected for business category sponsorship.
  • the invention provides a business model in which the providing organization partners with a print yellow page or other business directory publisher and/or with direct marketing organizations to subscribe merchants, businesses, individual professionals, or other organizations.
  • the invention provides a business model in which the providing organization partners with a yellow page, wireless providers, telephone companies, and conventional 411 call centers to generate call traffic and thereby increase revenue.
  • the invention provides a business model in which conventional 411 directory assistance providers are replaced by the inventive system to save carriers 411 costs and to offer a new shared revenue channel and business model.
  • the invention provides a business model in which new customers are solicited and provided with value added services.
  • the invention provides a business model in which direct - S i - marketers are provided with a new coupon or promotion distribution channel.
  • the invention provides a business model in which new business is brought to direct marketing organizations through the voice channel of the inventive system. In still another aspect, the invention provides a business model in which direct marketing organizations are provided with rapid marketing feedback for their clients and customers.
  • the invention provides a business model in which print yellow page and print business directory publishers increase their existing revenues by increasing the yellow page or directory ad size sold by virtue of ad space required by a trademark and/or "voice coupon" icon or logo.
  • the invention provides a business model in which print yellow page or other print directory publishers are provided with increased usage of the yellow pages or directory to show which vendors provide immediate savings.
  • the invention provides a business model in which yellow page or other print directory publishers are provided with an enhancement in their market positioning by virtue of their providing an more complete and compelling offering.
  • the invention provides the closest locations for a particular requested category where the location of the caller is known from a caller location input, cellular signal triangulation, GPS position determination, or other position or proximity location means.
  • the invention provides means for a merchant to interact with the system using either voice or web interface and select templates for the type of social, economic, political, age, gender, profession, or other image the merchant wants to portray and the type of promotion message the merchant wants to publish with a small number of mouse clicks, key strokes, or voice commands and prompts.
  • the invention provides system and method for establishing user groups ("My" ... group) and communities based on lifestyles, usage patterns, interests and interest levels so that a registered user can subscribe to a group of multiple groups where merchant listings and other relevant information is given a priority.
  • group or community may include a bicycling enthusiast who can subscribe to the bicycling group and when he calls, he says "My bike” and will then be offered services that has something to do with biking.
  • the invention provides means for obtaining ratings, in which once a caller gets connected to the business through the talk411 service system, after a period of time measured in hours the service calls back the caller to ask for ratings or to collect feedback to improve service, where the caller has either registered to permit this inquiry or does not have caller ID blocked.
  • the invention provides system and method for merchants to post customer testimonials so that future callers can hear these messages as a reference that may help make a choice of which merchant they want to be connected with, and optionally, as the service gets used callers can leave testimonial messages which the business can choose to post for other users access.
  • the invention provides system and method for merchants to post key words on the voice system or internet site which can be used as a search term by the caller and that will be used as a navigation pointer to the posting merchant.
  • this key word or phrase based search may be used in connection with a new product or service (new movie, new CD, new restaurant, or any other product or service) so that the caller may speak (or otherwise input) this key word and be matched with one or more merchants offering it.
  • priority use of such key word may be auctioned to merchants for priority playback to callers.
  • the invention provides system and method allowing merchants to post their promotional message or company information in multiple ways including to record their own, select from a voice talent who would record the text the merchant put in the system using voice, a personal computer, or in any other way; or just type in an let a text-to- speech processor convert it.
  • the invention provides a coupon aggregation and translation engine and service allowing aggregation of different formatted coupons from online sites and reformat such coupons to a standard Dialsurf or other format that allows them to be played over the telephone phone.
  • the invention provides user call back to remind the user to rate a recently used local merchant service.
  • the invention provides means for an over the telephone offer for a user to become an instant member to a community or coupon distribution list using email or other communication means once the user asks for a specific category.
  • the user if the user is registered and set up "my411" features, the user gets coupons for this category delivered to his "my411" voice box.
  • the invention provides merchants the ability to identify specials in their "additional information" or "coupon message" that is used to find that merchant when those words are used over the phone in the key word search mode
  • the invention provides system and method for publishing secret words in local newspapers, Internet chat rooms and other community oriented online and offline boards, where they can be used on the phone or on our web site as a password to enter a sweepstake or win a prize, and to thereby provide a beneficial marketing tactic to increase sales of print papers and increase traffic to online portals
  • the invention provides voice coupon targeting based on area code and prefix, city, geocoded location, GPS location, zip code, cross streets, vicinity of a milestone, major tourist areas, major landmarks, airports, night clubs, entertainment centers, shopping malls, restaurants, and the like.
  • the invention provides means and business model for facilitating spread of secret words through the word of mouth initiating from someone in the company to provide access to privileged information, prizes, etc. to enhance the repeat user experience
  • the invention provides the use of live agent interaction on a random call basis to provide a surprise element to enhance the user experience, where live agents can be celebrities.
  • the invention provides user choice or automatic choice of voice of the Talk411 attendant based on gender, age, interest, and other selection criteria.
  • the invention provides user choice to select synthesized voice of a celebrity as the automated attendant.
  • the invention provides system and method for insertion of trivia questions where correct answer wins a prize from a local merchant.
  • the invention provides a business model in which 41 1 directory assistance call centers are provided with an added revenue stream to improve slim profit margins, to increase call volume by providing a desirable information service, to keep their current carrier customers with value added services, to decrease their operating costs and overhead by reducing the number of human employees, and by overcoming severe local competition on wireless carriers.
  • the invention also provides further apparatus, system, method, operating model and business method, computer program and software, and computer software program products that interoperate with the inventive systems and methods.
  • FIG. 1 is a diagrammatic illustration showing an exemplary embodiment of the inventive system.
  • FIG. 2 is a diagrammatic illustration showing an exemplary embodiment of speech server functionality.
  • FIG. 3 is a diagrammatic illustration showing an exemplary embodiment of a new business user (merchant) interaction with the inventive system.
  • FIG. 4 is a diagrammatic illustration showing an exemplary embodiment of an existing registered business user interaction with the inventive system.
  • FIG. 5 is a diagrammatic illustration showing an embodiment of a general consumer user interaction with the inventive system.
  • FIG. 6 is a diagrammatic illustration showing an exemplary implementation of the inventive directory service on the Web.
  • FIG. 7 is a diagrammatic illustration of a flow-chart for a method of interaction between the inventive system and a caller, including call flow procedures and procedures for articulating messages to the caller and for receiving inputs from the caller.
  • FIG. 8 is a diagrammatic illustration showing an embodiment of an alternative call handing procedure by the inventive system and service call center.
  • FIG. 9 is a diagrammatic illustration showing an exemplary coupons web page.
  • FIG. 10 is a diagrammatic illustration showing an exemplary get business web page.
  • FIG. 11 is a diagrammatic illustration showing an exemplary member menu web page.
  • FIG. 12 is a diagrammatic illustration showing an exemplary member registration web page.
  • FIG. 13 is a diagrammatic illustration showing the relationships of several web pages to each other.
  • These goals do not rise to the level of requirements, as a system that fails to meet all of the goals may nevertheless provide advantageous utility, but arguably the more goals that are satisfied, the more satisfying the implementation may be.
  • Many of the features described in the following paragraphs are also clearly optional and although they provide great utility and many benefits, would not be required in a basic system or method.
  • Talk411 is a one-stop fresh information source that offers instant promotion capabilities to local businesses using the telephone or the web.
  • the information service is free for consumers. It is a promotion service for local businesses similar to the Yellow Pages, except it can be accessed with voice interaction on the phone or with display devices equipped with an Internet browser.
  • the information database (content) may be acquired in any number of ways, including from one or more of the following ways: (i) Purchased from local telephone operators, such as Pacific Bell or other operator, (ii) Input from local businesses who subscribe to the service, or (iii) Content partners who provide local information on the web.
  • the local businesses can subscribe, access and maintain the information they want to communicate to the consumers in at least the following two ways: (i) Call the Talk411 service and voice navigate/input, and (ii) Browse the Talk411.com and input by typing the information, uploading business information such as logo, etc.
  • the consumers can access the information in several ways as well, including: (i) Call the Talk411 service and voice navigate and speak naturally, and (ii) Browse the Talk41 l.com web site. Selected features that a business may user are now described. These are examples to illustrate some interesting aspects of the invention and are not limitations or requirements of any particular embodiment.
  • Businesses can, for example: (i) Subscribe, register, and input their business information and promotional messages using the telephone or the web site, (ii) Become a member to the standard service, where they pay a low monthly fee in exchange for extended business information prepared specifically for the category they belong. This information is provided upon information request about that specific business, (iii) Become a Category Sponsor, where a message is played/displayed when a caller request businesses in a category, prior to playing the choices, (iv) Purchase Category positions for an additional monthly fee, where they purchase the right to be announced/displayed at a priority level when the callers/browsers request businesses in a specific category. Category Level 2 position for the month will ensure that the promotional message is announced displayed in second place when the caller requests businesses in that category.
  • Audio Coupons can be purchased with a credit card on the Talk41 1.com web site, or on the Talk411 telephone service, (vi) Get reports on access/call patterns in their category, (vii) Get consumer ratings of their businesses upon participation in the "BayHits" discount program. This program allows registered users to get discounts from participating businesses in return for feedback on products/services delivered.
  • Consumers can, for example: (i) Obtain free of service charges, unlike the 411 directory services which cost about $0.46 to about $0.95 (local phone charges still apply) get local business information, (ii) Have hassle free navigation with voice on the telephone, focused web site with local business information on the Talk41 l.com web site, (iii) Obtain more information than just the number and address for the local businesses.
  • a caller or user (whether the business or other organization providing talk411 information of a customer making an inquiry) needs to be able to interact with the system with the utmost ease and simplicity. Speed and simplicity should desirably only be compromised for accuracy.
  • the effectiveness of the system and service may be assessed and measured with the ability of the caller/user getting information with no more than one phone call in most cases.
  • a key benefit for the callers are convenience and savings from local merchants/businesses, hence improving the quality of their lives.
  • Key benefits for the merchant/business/organization are the speed of deploying promotions (or other information) at a low cost, ability to get quick feedback from the marketplace and change their promotion or message dynamically, hence generating quality self- targeted leads.
  • the first impression of the users should desirably be to fun, fast, accurate, and helpful information retrieval.
  • limit the registration to phone number and password only and direct potential member to web site to complete the registration, perhaps offering some incentives for such web based registration.
  • Simple navigation using a limited number of intuitive key words to access information with a special lead phrase are desirably used.
  • a greeting at entry and hang-up may be provided.
  • response time it is desirable that the wait not exceed the typical time for operator assisted directory assistance, and a wait time of no longer than about 10 seconds (and preferably less than about 5 seconds) for information.
  • the location can be any designation, for example any or all of an address, a city, a zip code, or major cross streets or other recognizable location. If a location is not specified, then various rules or policies may be used to select, for example, in one embodiment, three possibilities are deliver by city or street names. In the event that there is more than one of the business in the area searched, for example there may be three Radio Shack locations in the search area, then the caller may be presented with three alternatives and locations selected according to any scheme, and asked for a decision with which to connect. In either case, the selection may be made either by the number or by the location.
  • Embodiments of the inventive system and method provide optional features including "more” information, and a "bayhit” feature.
  • the "more” feature is provided for business or organizations that prefer to provide additional information than standard phone number, address, type of business information.
  • N "bayhit” feature relates to a rating popularity or recommendation feature as described elsewhere herein. Businesses or organization, or specific locations of a business or organization, having more information may be identified using a distinctive sound, tone, word, or other audible indicator to identify them as having "more” information that may be accessed. Use of a distinctive sound reduces the time required to communicate the information and also reduces the computational burden of delivering additional spoken words.
  • the bayhit feature (or equivalent by another name) may be indicated by a different sound, tone, or word than the "more” feature, and businesses or organization that are both more and bayhit may be identified with some third set of audio indicators or hybrid or composite sounds.
  • the audio coupon may advantageously be played or enunciated at the time the phone number is delivered.
  • the caller may be given an option as to delivery of the audio coupon.
  • the system advantageously retrieves a predetermined number of the closest matches to the requested category and location.
  • providing five or more or the closest matches might be appropriate with due consideration given to the brevity of delivering so many alternatives.
  • the names of the matches are delivered along with the availability of any coupons (when the coupon feature is implemented) and optionally the content of the coupon.
  • Relevancy may be based on a user location input, this location input may be expected to optionally be provided automatically by caller identification based location determining mechanisms, cell phone base station triangulation mechanisms, or phone based Global Positioning System (GPS) location determining mechanisms.
  • caller identification based location determining mechanisms may be expected to optionally be provided automatically by caller identification based location determining mechanisms, cell phone base station triangulation mechanisms, or phone based Global Positioning System (GPS) location determining mechanisms.
  • GPS Global Positioning System
  • the caller Upon delivering the predetermined number of matching (or most nearly matching businesses or organizations), the caller is requested to chose from these or to request delivery of additional matches. Either a predetermined number of additional matches may be delivered or the system may request the caller specify how many additional matches, with an optional limit imposed by the system to reduce the burden on system resources and the total connect time. In one embodiment, the number of matches delivered at each stage is three matches, and the total number of additional requests that can be made is four. More generally, the number of matches delivered for each request and the number of requests for additional matches for the same call may be limited to suit local markets and customs.
  • the system desirably recognize the caller's choice by business name or by either a vocalized queue number or phone keypad press corresponding to the number. IF one of the matches has more than one location in the relevant search area the caller may be presented with the alternative locations or address information when that business is selected. In this regard, the system may filter a business that has multiple locations so that the predetermined number of matches are to different businesses rather than to multiple locations of the same business entity. If the caller is interested in that business entity, then the caller is provided with information as to the multiple locations and asked to select or choose the desired location.
  • the phone number for the selected business or organization (and location) is then delivered and the caller is optionally but advantageously asked whether the caller wishes to be connected.
  • the ability to be connected is particularly advantageous for a caller using a mobile phone where materials to write the phone number may not be readily available.
  • Automatic connection is advantageously free to the caller and may be free to the business or may incur some additional fees.
  • Various alternative credit, bonus, fee and non-fee based mechanisms may be implemented.
  • the caller may be desirable for the caller to provide a phone number to a particular business name and location, and to have the system search and verify a business name by saying the phone number, and getting the business name back with the coupon if available.
  • This feature may be referred to as a reverse search by phone number with optional coupon retrieval. This feature permits the caller to effectively make an inquiry as to the availability of a coupon offer at a particular business establishment, to make comparisons between business establishments, and to make a decision accordingly.
  • a coupon request may be made by the caller asking for a category alone, a category and location, or any other item or combination of items on which the database may be queried. Based on the results of such a category or other criteria search, the coupon may be provided. Note that although this and the above discussion focus on the voice based system, these coupon mechanisms (particularly the ability to perform directed searches on the basis of business identity and category) may also be implemented for the Internet based service.
  • audio coupons delivered in this manner should have a maximum permitted length.
  • coupons are limited to 10 seconds and in another embodiment to 5 seconds or less.
  • Caller consumers are not generally interested in receiving additional advertising, rather they want to know the essence of the offer, what discount, free item, cost saving, or other enticement is being offered, and no more. They are then free to make an inquiry once connected to the business, merchant, or other organization as to further information. Again, there is desirability of satisfying the needs of the caller consumer, the desires of the business, and the practicalities of system implementation where the number of lines, processors, servers, and other software and hardware infrastructure and impacted by call volume and duration.
  • the system dials and connects the call, checks for an answering machine tone, delivers a message that informs the business that this is a Talk41 1 call and identifies the coupon promised to the caller, and transfers connection with the business to the caller.
  • the caller may be connected to the business during this hand over or transfer process, but normally this is disadvantageous as it appears to add undue complexity to a process that the caller should view as transparent and effortless.
  • the caller can take action to terminate the call with the business and reconnect with the talk411 system.
  • This predetermined time period is anticipated to be in the 15 second to 2 minute time frame, and nominally about 30 seconds so that if the caller determines the selected business does not satisfy the need of the caller, he or she can try again without undue difficulty or expense.
  • the caller action involves pressing a key or combination of keys so that commands to the system may be clearly distinguished from a verbal discussion with the proprietor or employee of the business.
  • a tone signaling scheme will likely be more readily inte ⁇ reted than a voice command, particularly if automated switching and routing is involved.
  • Embodiments of the system provide the ability for a caller to rate or otherwise provide positive or negative feedback regarding a talk411 member business or organization.
  • the rating of any particular member should be limited in some way to avoid the possibility of a single caller, unduly influencing a member rating in either an absolute or statistical sense.
  • the limit may be rules that provide for a maximum total number of rating inputs for a business, a limited number per month or per year, or some other rating restriction rules. The goal is to provide other callers with a rating that fairly represents the opinion of all callers that have had an experience with the business, and not to allow either an individual that has had a bad experience to rate the business negatively multiple times.
  • Control mechanisms would also be desirable to keep the proprietor of the member business from self-rating in an unwarranted manner. These controls may involve technological mechanisms and/or legal contractual agreements with the member businesses.
  • a caller desiring to rate a member business has to sign on or register in some manner so as to identify themselves. For example, the caller may sign on to the system by saying a phone number and a password.
  • Individuals desiring to participate in rating member businesses may be provided with certain benefits, points in a points or award program, discounts, or other promotional and/or marketing enticements for their participation.
  • the system implements a Rate2save points program wherein a registered caller earns points that may be redeemed for goods or services in exchange for rating businesses of organizations.
  • registration is not required, however, the rating made is associated with the caller's telephone number if the system can determine the callers caller ID so that multiple ratings of the same business from a single telephone number cannot be made, or cannot be made within predetermined time intervals.
  • a message may also be played indicating that the caller's telephone number will be examined and compared for the purpose of ratings control and requesting permission to continue.
  • a caller will say the name of the business, location or other information such as phone number to quickly identify the business and rate on some predetermined scale, such as for example, a scale of 1 - 10, or any other numerical or non-numerical scale that permits the caller to use other adjectives such as for example poor, good, great, exceptional, too expensive, no parking, or any rating information that would be of use to a later customer. Ratings in multiple areas may also be permitted. For example, a restaurant may be rated according to food quality, food portion, service, price, wine selection, parking availability, or other factors of interest to potential customers. As system evolves and direct or indirect feedback is obtained from businesses and consumers, particular rating categories and scales found to be useful will be identified for the various categories of business and the ratings categories may be updated and modified accordingly.
  • hit sights For embodiments of the invention that provide a hit or popular site (restaurants, night clubs, coffee shops, bookstores, etc), best sites, highly recommended sites, or the like listing or directory such hit sights are referred to as Bayhits after the San Francisco Bay / Silicon Valley region where the invention was developed. Other regional areas may adopt other names for such attractions, such as Twin-City Hits (Minneapolis -St. Paul Area), LAHits (Los Angeles), or any other title whether containing "hits" or not.
  • Twin-City Hits Minneapolis -St. Paul Area
  • LAHits Lis Angeles
  • While many of the advantageous features of the inventive system and method are provided using voice recognition and text to speech processing over existing telephony infrastructure, other administrative facets may advantageously be taken care of using an Internet Web site or live agent. Live agents may also be occasionally used to set up and register. These administrative features may only be implemented where the clarity and preciseness of textual input and/or the volume of input increase the efficiency and accuracy of the process. These administrative procedures may also involve hybrid voice and Internet components. For example, an initial stage of registration or sign-up may be made over the telephone with minimal information provided over the telephone. The callers are then directed to a web site where registration may be completed. This form of registration may apply to either or both of businesses and consumers, for example consumers enrolled in the Rate2save points program.
  • Merchants, businesses, and other organizations may register by providing their phone number and a password for a trial period. This may include a simple registration with minimal information obtained over the telephone and direct callers to the web site to complete the registration in order to take full advantage of all merchant and business services and options. In instances where further registration beyond the trial period are required, especially if the additional registration is required for the merchant or business to avail themselves of additional capabilities, such registration may be completed over the telephone, over the Internet, or through the use of paper document of electronic contracts or agreements.
  • the basic information for the business such as formal and common name of the business, business address, business phone number, are verified so that the business may subscribe to the inventive Talk411 service. If the business or organization is not in the database contact is made though a customer service representative or other means. Credit card or other bank or payment information is provided to initiate of continue subscription to the service. The business or organization is then provided with an opportunity to record a message to be heard by a caller when callers ask for more information about the business. In one embodiment, the duration of this business introductory message is limited to a predetermined period of time, usually between 10 seconds and 1 minute, and more typically not to exceed
  • a business or organization once subscribed (or registered in a trial period), may purchase sponsorship for the category they belong to by specifying the category, number of months category sponsorship is desired, and level of sponsorship.
  • the sponsor business is put in an ordered list or queue (or random pool) along with others who have paid for the same level of sponsorship. When its turn comes (whether through a sequential ordering or via a random or statistically determined selection), the name and phone number
  • Level 1 consists of those sponsors who paid the highest to have a greater frequency of presentation to callers (a higher hit rate), level 2 with lower frequency or lower hit rate and cost, and level 3 with the lowest frequency and hit rate and cost.
  • a category sponsor may pay for a fixed number of placements per week or per month.
  • Other placement parameters may pertain to geography, time of day, length of message, or other parameters that may tend to drive more business volume to the category sponsor. While category sponsorship is one type of sponsorship that may be implemented, sponsorship opportunities may exist for other aspects of the Talk411 service.
  • Voice or audio coupons may be created and purchased once a business or organization have subscribed or registered.
  • a merchant will telephone and specify the number of coupons he or she wants to purchase and provide credit card information or other billing information and confirm the purchase.
  • coupons will be bought in quantities in excess of 1000, though there are no limits on the quantity that may or must be purchased as any single transaction.
  • the message that will be played in the form of a coupon is recorded (by the merchant or an agent of the merchant (such as a hired talent), along with the name and phone number of the business. This coupon message will not normally exceed a predefined duration, though different durations may be imposed for different types of business, membership entries, geographic regions, or other criteria.
  • the voice or audio coupon will not exceed 30 seconds in duration, more usually not exceed 15 seconds, and frequently will not exceed about five seconds. Some flexibility may be provided according to a minimum and/or maximum information that the merchant is entitled to record.
  • the merchant is notified by the system that this is a Talk411 call and play the same coupon message to remind the merchant (or an employee of the merchant) about their promotion so as to avoid any ambiguity or problem associated with an uniformed employee contesting the caller's (now customer's) entitlement to the coupon offering.
  • a simple cumulative rating that is compiled from the raw ratings of the users is delivered when the merchant calls to check the rating.
  • Security protections such as password or numerical code may be required to access such ratings, particularly when greater detail is provided to the merchant than to the general public.
  • Other embodiments provide for callers to rate the business on multiple criteria as may be relevant to the category.
  • inventive voice or speech based system provides numerous features, capabilities, and advantages over existing systems.
  • inventive system and method also provide for both separate Internet or world wide web based interface as well as to an integrated system providing both voice/telephone accessed and character/Internet based access. All, or nearly all, of the information described for the voice/telephone accessed features and capabilities are also available for access through a computer graphical user interface
  • GUI over the web. It is not required to be able to interact with the web-based user interface with voice, or to connect the user to the merchant with the phone in this phase, however it is desirable to be able to play back the messages during an Internet web-based session that were recorded on the phone. It may even be desirable to allow a user to record a message using local available computer resources and upload the message to the system, however, this is not required.
  • the user is provided with the capability to select a business and connect to the business using the Internet with Voice Over Internet Protocol based technology.
  • the graphical user interface provides a differentiation between a recorded message versus a text message prior to someone clicking the prompt, so the user expectation is set properly.
  • the web pages are desirably sized and formatted to display on a single viewing screen without the need to scroll.
  • buttons and menus are advantageously provided where access and interaction is as simple as clicking on the available selections, and a single mouse click is used where possible.
  • Quick page access is a high priority to encourage use without undue effort, therefore any graphics are desirably used judiciously only where necessary.
  • Simple and clear icons, tabs, drop downs, and the like features are used.
  • Some space may be reserved for ad banners and other content that may provide useful information for the user or a source of revenue for the site, site operator, or talk411 provider. Coupons will desirably be highlighted so the user can easily understand that they are coupons, and provide facility for printing either as part of the web page or via a separate but readily available free program.
  • Information storage, search, and retrieval are desirably fast and efficient to minimize any delays.
  • Search and retrieval of business/merchant information such as name, phone number, location, address (any elements of the address) from any of the known data above.
  • Search by category is supported, desirably with result priority given to category sponsors, and to those with coupons.
  • Reserve a small section where BayHit businesses can be featured with a distinct graphic, linking to their business information.
  • the result of a search will yield a short list of found or matched (or closest match) businesses with the business information for a predetermined number of (typically 2-5 and usually 3), category sponsors visible in most of the page. The user may then click and select others in the list if not satisfied with the sponsors found.
  • Each result page served with the next button should desirably serve the same format as above, except changing the sponsors and the short list.
  • subscribed business users may be provided with one page where they can provide essential business information that will show as a result to a user search. They can select from an available selection of templates, upload their logo or graphics, and input their information. There will be a dedicated part for displaying their coupons if purchased. Custom pages not conforming to the template may be available as part of the standard subscription, available only to higher membership levels, or available at a higher fee. In one embodiment, customization is permitted but only within a certain range of options that preserve the talk411 look and feel. A dedicated management page may optionally be made available for the subscribers. Subscribed business will be able to track their coupon promotions, category sponsorships, how they are rated in the category, and their account history. Other information may also be provided.
  • a business, merchant, or organization are able to type in or otherwise enter or present the Coupon message or the Business information into the provided field which will then be translated into speech when a caller asks for the information over the telephone.
  • Business/Merchant will be able to type in the information, and request the information to be recorded by a professional voice (or audio) talent from the list provided on the web site.
  • This alternative feature is provided via an optional voice talent portal (VTP).
  • VTP voice talent portal
  • a user is able to select, such as by clicking on an icon or using a pull down menu, and hear a recorded clip of a message delivered by a registered voice talent as a preview.
  • Each registered voice talent will have a private page where pending jobs will be posted and messages to be recorded displayed. Once the voice talent completes the recording of the job, he/she will be able to upload the wave file (or other file format) for the recording to their private page, and verify and publish it for review by the business. A phone alert with the recorded message will be made to the merchant or organization to confirm and publish the message.
  • the design of the system is guided in part based on empirical results. These results suggest that the voice-recognition should desirably be highly accurate, preferably accurate better than 95% of the time. Automated announcements should desirably be a compelling and use consumer tested voice. Response time to voice inputs should ideally be no more than about 1 second in most cases, and typically no more than 3 seconds when searching.
  • Callers should desirably be able to interrupt any time and the system and method should desirably be able to deal with this. Transactions and identities should desirably be kept secure by the high standards.
  • Transactions and identities should desirably be kept secure by the high standards.
  • For consumer calls when the system does not understand after two tries, it should desirably bounce to a live operator, or take other action in an attempt to understand the consumer. For example, if a live operator is not available, the system may use a more sophisticated (but possibly slower) processing algorithm to understand the speech.
  • business subscriber calls when they say,
  • help there should desirably be some automated help, and when they say “save me” they should desirably be connected to a customer service representative.
  • FIG. 1 shows the top or high level system architecture 102 and consumer user (or caller) 101 access points for a preferred embodiment of the invention.
  • a cellular phone 106 (or other wireless device), standard telephone 118, either an analog (POTS) or digital, can be connected to the architecture 102 using a standard telecommunication link 120, such as a standard telephone line 122, ISDN line 124, cable 126 or DSL line 128.
  • POTS analog
  • a standard telecommunication link 120 such as a standard telephone line 122, ISDN line 124, cable 126 or DSL line 128.
  • POTS analog
  • a cellular phone 106 it is understood that there is a cellular base station 108 and a cellular switch 110 interposed between the cellular telephone 106 and the PSTN 112.
  • the incoming call 130 from the cell phone 106 or the telephone 118 goes through a PSTN 112 and telephone switch 114 and gets picked up by the Speech server 116, which is connected on one side 117 to the telephone switch 115, and on the other side 119 to a computer network 130 such as for example the Internet.
  • WAP Wireless Access Protocol
  • Data may therefore be communicated in a WAP compatible format.
  • WAP is a standard set of protocols for wireless Internet access which can deliver special Web pages to portable devices such as smartphones and palm-portables. In current versions it relies on the use of WML instead of HTML and so on. These systems generally support Java and Windows CE, among others. WAP Version 1.0 has been released and is hereby inco ⁇ orated by reference along with variants and extensions thereof. Other versions are under development.
  • the user (or caller) 101 can access the information or data 132 that resides in a database 134 within the Information Center 136 and the Web Site 138 through interactive voice commands 140 and/or through keypad presses 142 on the caller's 101 device, such as on the cellular telephone 106 or standard wired telephone 118.
  • the caller's voice commands 140 are recognized and translated into one of the variations of Voice Extensible Markup Language (VXML, VoiceXML, or VOXML) commands 144 by the Speech Server 116 using a speech-to-text conversion engine 146 and once translated into VXML are used to retrieve the information 132 from the Information
  • VXML Voice Extensible Markup Language
  • VoiceXML VoiceXML
  • VOXML Voice Extensible Markup Language
  • VXML is an extension or elaboration on the XML (Extensible Markup Language) standard known to workers in the art and not described in greater detail here.
  • Information concerning the VXML Forum is available on the world- wide- web at http://www.vxmlforum.org/ and Version 1.0 of the VoiceXML specification dated 07 March 2000 which is hereby inco ⁇ orated by reference is available in Adobe Acrobat format at http://www.vxmlforum.org/specs/VoiceXML- 1 OO.pdf .
  • Speech server 116 also generates and plays back (presents) pre-recorded or synthesized menu commands 150 to the caller.
  • the system architecture connects 102 the information database 134 to the Internet 130 (or other local or global network of computers and/or information appliances) which can also be accessed with a display device 152 such as a personal computer (PC) equipped with a modem 154 (wired or wireless), a smart phone 156, other wireless phones or devices, a PDA or palmtop device 158 or any computer or other information appliance or device that can be connected to the Internet (or other local or global network) with the ability to display standard Hypertext Markup Language (HTML) pages or other formats inte ⁇ retable by the computer 152. Access to such wireless devices may use the Wireless Access Protocol (WAP) and/or other protocols or standards.
  • WAP Wireless Access Protocol
  • inventive structure and method are not limited to these particular formats and/or protocols or to the versions of these protocols in existence at the time the invention was made as those workers having ordinary skill in the art will appreciate the capabilities and features provided by these formats and protocols may be provided in other ways and that future versions of these formats and protocols will also support the inventive structure and method.
  • Embodiments of the inventive system may desirably inco ⁇ orate and utilize natural language speech recognition.
  • the user can naturally speak and the system inte ⁇ rets the user' s speech to extract the request or inquiry.
  • The provides additional flexibility for a user as that user does not need to know any particular commands or request rules or syntax.
  • Natural speech processing and artificial intelligence are known in the art and not described in greater detail here.
  • ASPs application service providers
  • FIG. 2 shows an embodiment of Speech Server 116, some of its functions and functional connectivity to receive a switched telephone call and to interact with the Internet 130.
  • Speech Server 116 performs several tasks such as the task of providing a Network Interface 160 to the analog or digital phone network that provides the switched phone call 162, Automatic Speech Recognition (ASR) 146 or speech-to-text conversion (STT), Text-to- Speech conversion (TTS) 148, runs the application or application program 164 that control and manages the phone calls 162 and the Interactive Voice Response (IVR) 166.
  • IVR refers to the interactive voice response which is conventionally a menu driven response provided in response to an input.
  • the Speech Server 116 is a personal computer equipped with Dialogic Antares automatic speech recognition boards and other products.
  • Dialogic Antares boards are available from Dialogic Co ⁇ oration, 1515 Route Ten, Parsippany, NJ 07054-4596 USA and on their web site at http://www.dialogic.com products/indx_abp.htm.
  • the incoming call 162 is answered by a network interface card 160, such as for example a
  • Dialogic network interface card (analog or digital).
  • a prompt is played to the caller 101 over the caller's device 106, 118 asking the caller to say the selected item 170 from the available selections on a voice or audio menu.
  • the application 164 passes the voice data to the auto speech recognition block 146, such as may be provided by a Dialogic AntaresTM board loaded with an Automatic Speech
  • ASR Automatic Speech Recognition
  • L&H Lernout & Hauspie
  • Fax 1-781-238-0986, http://www.lhs.com
  • Speech Works Speech Works International, Inc., 695 Atlantic Avenue, Boston, MA 02111, Tel: 617.428.4444, Fax: 617.428.1122, http://www.speechworks.com.
  • a Dialogic Antares board-based automatic speech recognizer (speech-to-text)
  • the 146 translates the voice data into ASCII text (or another code or symbols) that identifies the spoken words and returns a text or other symbolic representation of the results to the application 164.
  • the application 164 accesses, via for example a T-l line or faster Internet connection, the database 134 ofthe Information Center 136.
  • Real-time (or near- real-time), active vocabularies are generated at run-time using the database's 134 ASCII text or symbols.
  • the application uses the ASCII text from the database 134, passes it to a second Antares board 148 running a text-to-speech (TTS) algorithm.
  • TTS text-to-speech
  • Dialogic Antares D41E printed circuit card configured for a Microsoft Windows based computer that supports up to four analog telephone lines and provides a telephony interface and voice processing functions such as playing and recording speech messages and collecting or dialing DTMF digits.
  • the embodiment also utilizes a Dialogic Antares 2000/50 platform is used in one embodiment, and allows the Nuance Voice recognition system (or other voice recognition system with appropriate modification) to operate properly on the Dialogic platform by providing echo cancellation and end pointing services for the system).
  • the installed card will support up to 16 channels of voice recognition.
  • the Antares 2000 family platform provides a standalone open digital signal processor platform for telecommunication applications.
  • the Antares hardware platform provides four independent TI TMS320C31 32-bit floating-point DSPs running at 50 MHz, each with high-speed SRAM, enabling algorithms to be easily ported to the AntaresTM board.
  • Multiple memory options provide flexible solutions for different applications. For example, 512 KB (128 K words) or 2 MB (512 K words) local SRAM are available per DSP and 4 MB (1 M word) or 8 MB (2 M word) global DRAM memory are available per board.
  • SCbusTM (or PEBTM) connectivity allows standard access to off-the-shelf call processing products and provides the capability to build higher density systems. Up to 32-channel capability is available.
  • the Antares platform open development environment uses the Antares board and an ANSI C Compiler and Assembler/Linker for the DSPs.
  • SPOX support includes the run-time SPOX DSP operating system and SPOX-KNL kernel development kit. Additional software includes the Antares kernel , which integrates the DSPs into the SCbus system and a down loader to download software to the board.
  • a single Antares board can process up to 32 channels of digital voice and digital telephony information received from a network interface board via the SCbus.
  • the SCbus bit stream from the network interface board can contain both audio (digitized voice) and telephony signaling information.
  • the bit stream is applied to a SC2000 ASIC chip which has an internal switching matrix. Under firmware control, the SC2000 chip can connect any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time slot (1024 for the SCbus or 32 for the PEB) to any external bus time
  • TDM time division multiplexed bus time slot on the Antares board.
  • the Antares board uses four C31 DSPs, each with its own Static RAM (SRAM) to separate and process data for any assigned time slot.
  • SRAM Static RAM
  • Each DSP processes data based on downloaded firmware stored in its SRAM.
  • DRAM Dynamic RAM
  • Kernel services are available on every DSP.
  • the Antares local kernel provides all the necessary tools to read and write data to and from the TDM bus, which operates as a PCM (pulse code modulated) highway.
  • PCM pulse code modulated
  • the kernel can bring multiple frames of PCM data (up to 32 time slots each) into DSP SRAM from the TDM bus.
  • Each DSP connects to the local TDM bus via a serial interface.
  • the TDM bus is implemented with 32 time slots that are mapped into 1024 external SCbus time slots by the SC2000 chip.
  • Each SRAM contains the downloaded firmware that controls DSP processing and also provides the DSP with dedicated RAM for its data processing operations.
  • a control port on the Antares platform accepts operational commands from the host PC and monitors DSP-to-host PC interrupt status.
  • This control port controls Antares platform operations such as: DSP reset; communications with and firmware downloading from the host PC via shared global DRAM read write operations; the transmission of interrupts, control, and status information to and from the Antares platform via the global resource circuits.
  • DSP reset communications with and firmware downloading from the host PC via shared global DRAM read write operations
  • firmware to control each DSP's processing is downloaded from the host PC to the DSP's SRAM.
  • This downloadable firmware enables feature enhancement, application changes, and upgrades.
  • Dialogic Co ⁇ oration is an Intel company, with world headquarters at 1515 Route Ten,
  • the above described Dialogic hardware and the computer system in which it is installed inter-operates with an operating system (such as for example, a Microsoft Windows operating system, a Linux operating system, a Unix operating system) and application program software including device drivers.
  • an operating system such as for example, a Microsoft Windows operating system, a Linux operating system, a Unix operating system
  • application program software including device drivers.
  • the Talk411 system is an integration of commercially available software applications and custom applications. These commercial software and custom software components include operation under a Windows NT Workstation (v4.0 Service Patch 4) operating system environment which is available from Microsoft Co ⁇ oration of Redmond, Washington, USA.
  • the software may alternatively be adopted to interoperate with any ofthe other commercially available operating systems such as earlier and later versions of Microsoft Windows, such as for example Windows 2000 Professional; or non-Windows operating systems such as Linux, Unix, and others as are known in the art.
  • the core voice processing application is Parity software's VOS Version 6W
  • Parity VOS (V6W dated 07-1999) is available from Parity Software, Three Harbor Drive, Suite 110, Sausalito, CA 94965.
  • Parity Software is a Dialogic company which is owned by Intel Co ⁇ oration of Santa Clara, California.
  • Parity Active Data Objects RLL (ADO RLL) is also used and is available from Parity Software, Three Harbor Drive, Suite 110, Sausalito, CA 94965.
  • the Nuance voice recognition support the voice recognition requirements.
  • Nuance Speech Recognition Software (v6.2.1) is available from Nuance, 1005 Hamilton Court, Menlo Park, CA 94025.
  • a VOS-Nuance integration Run-Time Link Library (RLL) provides an interface between the Parity VOS software and the Nuance speech recognition system.
  • An embodiment of a VOS-Nuance Integration Run-Time Link Library (RLL) is available from TeleVoice, Inc., 1 1767 Katy Freeway - Suite 425, Houston, Texas 77079.
  • the Parity VOS system includes a VOS Script file or files which have been compiled to binary code which can be executed by the VOS runtime-engine.
  • VOS licenses are required both to run the system (a run time license) and to compile the VOS script (a developer license), both of which are available from Parity
  • Licenses are controlled by a hardware-key ("software sentinel” or “dongle”) that plugs into the parallel port ofthe system.
  • Parity VOS script language may be achieved by external modules known as Run-Time Link Libraries or RLLs. Parity
  • RLLs with Parity VOS and other RLLs which may be desired to implement additional features are available from third-party vendors.
  • RLLs are used for example, to enhance the VOS capabilities to include such features as voice recognition, text-to-speech, access to ODBC-compliant databases and convert numeric, date and currency values to speech.
  • the application specific program modules used for the inventive Talk411 system and method include a Nuance voice recognition interface module and a SpeechPro module which converts numeric, date and currency values to speech.
  • Other providers ofthe broad class of text-to-speech converters (of which numerals-to-speech conversion is a simple subset) are known in the art.
  • Parity VOS provides a similar RLL called SmoothTalker with the VOS application.
  • RLLs take the form of DLLs or Dynamic Link Libraries. RLLs are normally specified in a file (such as for example in a RLLS. TXT file) which must be referenced both when compiling and when executing. These programming and compilation conventions and procedures are known in the art and not described in greater detail herein.
  • script files used by the application are designated by a “ . vs” (yos script) or “.vh” extension
  • compiled VOS scripts are designated by a “.vx” extension.
  • Exemplary embodiments of selected ones ofthe files are summarized below. Additional detail is available in programming manuals and user documentation from the vendor, either by mail at the address provided elsewhere herein or via an Internet web site provided by each ofthe vendors.
  • PR.PAR Compiled pr.def file tone. def Stores PBX specific Tone configurations (generated by Dialogic
  • Parity VOS is used in the particular exemplary embodiment, the inventive system and method are not so limited and that those workers having ordinary skill in the art will appreciate that other programming language and methodological constructs may be used to achieve the invention, such as for example implementations using VoiceXML, C++, or other scripting or programming language or languages.
  • Rlls.txt File used to specify VOS RLLs for Run-Time
  • the master.vs file is the file specified when launching the Parity VOS run-time engine. It contains all of the code necessary to open up the database, initialize and configure the voice lines and the voice recognition channels and spawn or launch the Talk41 1 application for each voice line.
  • the presence and general structure of master.vs (and it's compiled partner master. vx) are known to workers using Parity VOS and other such commercial programs.
  • Master.vs for the talk41 1 system is a file that does some set up, such as, determining how many phone lines are installed, and launches one talk41 l.vs application for each ofthe installed lines.
  • vx first determines and defines the number of voice lines installed. This represents the number of simultaneous callers (usually use function that detects number of hardware channels of lines installed, but where the number is stable, this may be hard coded). (Restrictions may also be applied where the number of lines is greater than the number of software licenses.)
  • Master. vx then opens a connection to the database in master that is shared by the applications.
  • This database can be a local database either within the same physical box, co-located in a separate box or device, or located remotely.
  • the database is a Microsoft Access (Access 97) compatible database, however other database types and formats may be used.
  • Parity VOS includes database functionality, so that a database created in Access may be used by VOS without requiring Microsoft Access or separate database application program.
  • the same data base file (dictionary32.mdb) is used to store and retrieve all ofthe business, merchant, individual, or other organizational material for both the voice recognition based talk411 and the Internet based talk41 1.
  • Master.vx code instructions then initialize the Nuance voice recognition that sets up the mapping between the inbound call line and the Nuance resources. It also maps (or sets up an association or routing of) the Nuance processing channels to the voice caller channels. Master. vx then spawns or launches one copy of talk411.vx application for every line installed (subject to the number of licenses). There is an iterative loop that increments for the number of lines available. Operation then continues under the control of each talk41 l .vx compiled application.
  • the Talk41 1.vs script defines how the actual call handling and processing happens.
  • the general flow ofthe call from the talk41 1 code perspective is as follows.
  • a code procedure module "Initialize()" sets up the telephone line or communication channel specific parameters and prepares the line or channel to accept a call.
  • This line or channel may be wired or wireless and so the term line or channel is used in its broadest sense.
  • a "WaitForCall()" procedure waits for a ring signal on the line and when it arrives, takes the line off-hook and prepares it for processing.
  • An "Initialize Voice()" procedure module sets up the voice recognition parameters for this line and prepares for a recognition.
  • a "MainMenu()" procedure is where the call originates after the above initialization. At this point we set the Nuance Engine to the ".MainMenu” grammar and wait for a result. It returns a series of "slots" each with a valid keyword. The call is then processed based on the result of this recognition. If the caller said the name of a business, we give them the business information. For example, if the caller spoke a "Type" of business, the caller is provided with information on business of that type. Alternatively, the caller may either have spoken a command (such as help, quit, member information, or the like), in which case the command is handled, or the spoken utterance may not have matched any options in the grammar.
  • the talk41 l.vx file is the compiled version of the talk411.vs file.
  • a exemplary embodiment of the talk41 l .vs is now described.
  • the talk41 l .vs script file and its compiled version talk41 1.vx is application program that controls how a telephone call to the talk411 service is processed.
  • An interface exists which permits signals sent from any telephone handset or other telephony capable device to be received and recognized by a computer or more precisely by a circuit or PC board designed to handle and process such signals.
  • Telephony-to-computer and computer-to- telephony products made by Dialogic that provide this signaling and call processing capability are described elsewhere herein.
  • interaction between telephone systems and computers primarily relied on touch tone frequency signaling, such as may be accomplished by pressing a series telephone keypad buttons to generate readily recognizable signals, or equivalently to have a computer program generate such tones.
  • new technology supports spoken voice command and data in a system that provides voice recognition to inte ⁇ ret the electronic signal generated from the spoken voice, inte ⁇ ret that signal, and convert that inte ⁇ reted signal into one or more commands or data items that can be processed by the computer. Effectively, voice is used to give or respond to an inquiry.
  • voice recognition components are known in the art and therefore are not described here in greater detail. References are also provided to companies and organizations providing this type of technology as standard off-the-shelf commercial products.
  • the talk411.vs, talk41 1.vx, and other associated or related files provide an additional layer or layers on top ofthe conventional telephony hardware and software, the voice recognition software and possible optional hardware or DSPs (voice-to- computer), and computer-to-voice (also referred to as text- to- voice), that direct the system in handling a call in a particular way and to provide particular features.
  • the inventive software and system configuration used in conjunction with existing or customized voice recognition features constitute a novel computing and information processing machine and method.
  • Microsoft Windows is used as a foundation platform. In its native uncustomized form, it supports only touch tone signaling, and does not support voice processing or recognition in its native form. Nuance has provided a voice recognition software
  • a third major software component integrates the Nuance voice recognition software with the Parity VOS software so that the three components accept voice or speech input, inte ⁇ ret it, create a character or a string of characters, and provide that character of character string in the manner of one or more computer keystrokes or other digital data stream that is readily inte ⁇ retable by the computer.
  • the Parity VOS software handles the interface to the phone system (including how to answer a call when a phone ring indicator signal appears, how to play messages to the caller after it answers, and how to accept touch tone signals form the caller. It also handles branching to further alternative processing based on what action is selected by user, and handles hang-up or call termination by hanging up the line, and resetting the line and the system and software to receive the next call. In essence is it responsible for call control. In many applications it is the only application software that is needed, particularly if the system is a touch tone based systems.
  • the Parity VOS software version 6W does not have native voice recognition built in. Parity
  • VOS version 7 is available, and it is anticipated that Version 8 will be released shortly. Version 8 may provide some additional voice processing or recognition capability, or provide an interface or other integration with Nuance Voice Recognition (and possibly including others, such as Speech Works software).
  • the interface software is presently a piece of software that permits use of Nuance Voice Recognition software and returns a result from that software to VOS for processing.
  • Parity Software also provides a developers toolkit in the form of a programing language and compiler for that language, so that software code, including data structures and executable instructions, can be developed to tell the VOS software and other components of the system what to do.
  • These developer tool kits and documentation for their use are available from Parity Software and are not described in greater detail here.
  • the talk411.vs script is developed using this VOS toolkit and compiled into talk411.vx. This script and its compiled version executes within the hardware to provide control the particular manner in which call flow is handled, what inquiries and responses are permitted (in conjunction with various dictionary and grammar filed described elsewhere herein), how to inte ⁇ ret specif commands and situations. In essence, the talk411.vs and talk411.vx files control the life of a call.
  • the exemplary talk411.vs file includes the following components or functions: main menu ⁇ MainMenu() ⁇ , business information ⁇ BusinessInfo() ⁇ , connect to business ⁇ ConnectToBusiness() ⁇ , more business information ⁇ MoreBusinessInfo() ⁇ , recommend others ⁇ RecommendOthers() ⁇ , type menu ⁇ TypeMenu() ⁇ , modifier menu
  • the MainMenu() function lets the caller select a business, a type of business or enter the member area.
  • the BusinessInfo() function reads the business information from the database and speaks the business Name, Phone Number, and any optional features such as the BayHit Message, Promotional Message, and Coupon Message.
  • ModifierMenu() menu function is used if the caller only said a modifier such as Coupons or BayHits.
  • the Playl(Prompt) function will play a speech message that can be interrupted by the caller speaking, where the result of their speech is thrown away, and where the function returns true if the message was not interrupted and false if the message was interrupted.
  • the MainMenu() function portion ofthe talk41 1.vs file is duplicated in Table IV.
  • Exemplary portions of talk41 1. grammar code are also providd in Tables V and VI.
  • this function lets the caller select a business, a type of business or, entry to the member area. Variables used by the function are reset, a greeting is optionally played, and a voice response is received from the caller. A determination is then made as to whether they spook the name of a recognized business, or of a business type, and whether any business type modifiers were spoken. The function also monitors to determine if any commands were spoken, such as for example "Help", "Quit", or "Member".
  • the GetVoiceResponse(.MainMenu", Prompt, "MainMenu”) syntax results in playing a prompt entitled "MainMenu” that speaks: "Thank you for calling talk 411, please speak the name of a business or a type of business and we will get the information for you.”
  • the words or phrases are played back from earlier recordings, and that in a different embodiment they are synthesized based on the words or text. Either approach may be utilized and some implementations may utilize a combination of the two. In general, the speech quality is better using pre-recorded speech and is preferred. Table IV. Exemplary embodiment of Main Menu from talk41 l.vs script file. func MainMenu ( )
  • the ".mainmenu" parameter is a reference to a Nuance voice recognition grammar file that will be used in processing the voice response.
  • Nuance requires the identification or definition of each symbol, word, number, or phrase that a caller may say in response to the prompt.
  • Many or even most common words and numbers, as well as common phrases are defined in files provided by Nuance; however, current versions of the software do not allow unlimited words such as might be supported by desktop voice recognition applications. That additional level of flexibility and sophistication invariably require prior training by the speaker so that the speaker's voice characteristics are understood by the speech or voice processing software.
  • grammar file are six top-level grammars: Business, Type, Modifier Type, Type Modifier, Modifier, and Commands. Anything with a capital letter as the first letter, such as capitalized "Type” means that the item or type is defined later on in the file or in another file. Further down in the same talk411. grammar file appears a block of code pertaining to further definition of the main menu "Type" top-level grammar. In what follows are a set of definitions, including definitions for such groups as attorneys, auto parts, and the like as indicated in Table V.
  • talk41 1 businesses. grammer but it is defined here (under the .MainMenu).
  • the talk411 business. grammer is a file the contains all of the actual business names with variations on the way a caller may refer to a business. For example, a caller may speak either "Cisco” or “Cisco Systems” and mean the same business.
  • Modes are identified with or mapped in similar manner using GSL.
  • the parenthetic lowercase “coupon” is the actual word coupon that is spoken.
  • These are the things that can be used to modify business or type.
  • Japanese ?food Restaurants Japanese ?food Restaurants
  • the grammar files are ultimately compiled to create a grammar library that identifies all ofthe various permitted combinations and permutations that are matched to.
  • a library of spoken words and phrases are used that are used to match and recognize the permitted words and expressions.
  • the libraries are built using the spoken expressions of human speakers.
  • the talk41 1. dictionary file is a file that contains an explanation of words or phrases that are not in the Nuance engine. This file is needed when one compiles, however the file may be and frequently will have no entries. It contains the phonetic
  • the Nuance Voice Recognition System was developed using source files called "grammars" which define the range of speech a caller can say in response to a given prompt. Like VOS scripts, once a grammar has been defined, it too must be compiled using the "compile-grammar” command. A Nuance license has heretofore been required on a per-port basis and the Nuance license has been controlled by a software key.
  • Top-Level grammars are initiated from the VOS script.
  • Sub grammars are essentially the building blocks used to creating a Top-Level grammar.
  • Each Top-Level grammar defines all of the allowable responses a caller can say in response to a voice prompt.
  • a simple example of a top level grammar would be “.Confirm”.
  • the ".Confirm” grammar may be used in response to a voice prompt such as "Is this correct?"
  • the ".Confirm” would be constructed of two sub-grammars, "yes” and “no”. Each sub-grammar would include all ofthe various ways a caller could say
  • Grammar files are designated by a .grammar extension. At least two additional files are required before a grammar can be compiled: a .slot-definitions file and at least one .dictionary file.
  • the .slot_definitions file defines slots for natural language recognition.
  • the .dictionary file (or files), for example the talk411. dictionary file, defines words that are not in the Nuance grammar and thereby allows the total vocabulary which may be spoken and recognized to be extended.
  • Embodiments of the inventive system and method provide the following five vocabulary related files.
  • the "Talk411. dictionary” file defines words that are specific to the talk411 system and therefore not in the Nuance grammar. (It is noted that future versions of Nuance may be modified to include talk411 words.
  • the "Talk411.grammar” defines the Talk41 1 application grammar.
  • a "talk411.missing” file is a file that stores the result ofthe compile and indicates words not in the Nuance grammar.
  • the "Talk41 1.slot definitions” file defines "slots" for the natural language recognitions.
  • the talk411. grammar file is the main file that links any and all other grammar files provided by Nuance. Any file that is a ".grammar" file is defined by an include statement in the main talk411.grammar file. Most of these .grammar files are provided by Nuance, such as grammars for obtaining and processing commonly used credit card expiration dates, through their commercially available tool kits. Note that in addition to the talk41 1 specific grammars, the main talk41 1.grammar file includes an initial set of include statements that link the other grammars. A portion ofthe code is provided in Table VII so that this grammar structure and relationship is clearly set forth.
  • Some "included” files handle, for example, common expressions, yes/no grammar, number grammar, thousands grammar, credit card number grammar, credit card expiration date grammar, and a larger talk41 1 business grammar that is a grammar containing the talk411 business, merchant, and organization names.
  • all ofthe grammars except for the talk411 business grammar are provided by Nuance, and are common grammars that are used for many different voice recognition applications.
  • the "Talk41 IBusiness. grammar” lists all ofthe business in the system.
  • this Talk41 IBusiness. grammer is a dynamic grammar that can be added to or modified on the fly, that is as the system is up and running and being accessed by callers.
  • this grammar is a static grammar meaning each business is added manually. In this manual version, the grammar is recompiled and the system restarted after making any changes to this file.
  • An exemplary sample entry in the grammar is listed below:
  • the entry on the left is the actual phrase of set of words the caller must say or utter to select this business and the entry on the right is slot (Business) and the keyword (Straw Hat Pizza) that would be returned to the application for processing.
  • the entry for the keyword must match the corresponding entry in the database exactly for a match to occur.
  • entries in the database and in the grammar may be established for variations ofthe phrase. For example, there may be an additional entry for "Straw Hat” that does not include the word "Pizza”. Therefore the requirement for exact matching is not unreasonable given that all of the possible variations that may match are made available.
  • the talk41 1 business are just a list of business, such as straw hat pizza, Michael Italian foods, or other business names. These are just examples, and the names are put in a separate file for convenient administration.
  • a business will be able to sign up without any human intervention using technology currently being developed. Under these systems, it will merely be a matter of the user typing in the name, and allowing the system to process it. If it can be recognized without help that will suffice, however in some instances it is anticipated that a user may have to say the new name one or more times. It is also possible to specify the phonetic characteristics using GSL or other phonetic descriptive language, however this skill would not generally be available to lay people using the system.
  • Nuance also provides for "dynamic grammars" that permits adding entries to a running application on the fly. In this way, business subscription or membership may be implemented automatically without requiring system restart or the like.
  • Nuance and Talk41 1 grammar files are listed with a description as to their function in Table VIII. Table VIII.
  • Nuance and Talk 411 Grammar Files common.grammar Nuance provided grammar of common expressions
  • the inventive system and method also utilize a database file that stores Customer or Business information.
  • the database file (directory2.mdb) is created as a Microsoft Access 97 database file. Microsoft access was selected for this embodiment, as the other program modules making use of the database are able to store and retrieve information from the file without having Access program running. Other database structures or programs may alternatively be utilized.
  • the database named "directory2.mdb” includes a table named "Customers” and Field Names: MemberlD, CompanyName, PhoneNumber, Address, City, Region, Category, Specialty, BayHitScore, BayHitRates, Coupons, and CouponMessage.
  • These fields include entries which are clearly optional in character and pertain to certain specific features provided by the inventive system and method.
  • the BayHitScore, BayHitRates, Coupons, and CouponMessage fields provide novel informational features not present in existing systems or methods. In fact, nearly any database fields may be provided, it is merely a matter of identifying the type of information or content that is to be provided in response to a caller.
  • these informational or content items will include some identification for the business or entity (company name), telephone contact information (phone number), and address information (city and street address). It may also be desirable to provide some regional information separate from the city and street address (regions), and such region information may define a local neighborhood, or a multi-city or metropolitan area. Driving instructions from key recognized points of interest or locations may also be provided. Specialty may permit a member business to identify a particular product for which they are noted. For example, an Italian Restaurant may wish to identify themselves as specializing in Italian Vegetarian Cuisine, a night club as specializing in swing dance music, or a book store as specializing in first editions. There are a myriad of other fields that could be provided in the database and for response.
  • the same database is used for the voice accessed and for the Internet accessed system.
  • the information may necessarily be limited to a set of words, short phrases, or expressions to simplify the voice recognition and text-to speech processes.
  • a single database may include entries that are common to voice and Internet accessed systems and those that are not common. For example, an additional set of fields may be provided that are served only when accessing over the Internet.
  • two different databases may be provided which are optionally synchronized.
  • One embodiment maintains separate databases that are regional in character with the concept that a caller in New York City would not need to access the information for businesses or organizations in San Francisco.
  • Such a database is defined in a memory storage device (either in solid state memory or bulk storage device) on a computer system.
  • the computer may for example be a server computer, many types of which are known in the art.
  • the computer and any attached storage may be on the same machine or on a different machine. Structures and methods for building, maintaining, and accessing databases are known in the art and not described in greater detail here.
  • Table IX Several fields for an exemplary customer data are listed in Table IX. These fields include the Company name and phone number, as well as optional but advantageously provided data fields for address, city, region, category, specialty, bay hit score, bay hits rating, coupons, coupon message.
  • a member ID is also provided for accounting and identification pu ⁇ oses.
  • each software module In order for the system to be started, each software module must be launched in the proper order. In one embodiment, the order for launching and a description of each module is listed below. It is noted that this ordering applies the particular scripts, compiled scripts, programs, or procedures as they presently exist and that variations in this order may be anticipated for a different set of scripts, compiled scripts, programs, or procedures.
  • the "license.exe” is launched.
  • This is the Nuance license engine that must be launched with a valid "license.txt” file which controls the available recognition sessions that are allowed on a given system. (This program file is executed as a condition of using the Nuance software and is not otherwise required for system operation.)
  • VOS-Nuance interface "VRServer.exe” program is then launched to provide an interface between VOS and Nuance.
  • VOS6W.exe program file is launched.
  • This VOS6W.exe file provides the Parity VOS application engine, and must have an identified valid .vx (compiled VOS script) file specified in order to start.
  • a business user 201 is a user that is providing goods or services to consumers 101 where consumers also refer to the previous caller 101.
  • the business user is desirous of having their goods and services made available to consumers over the inventive system and in promoting their goods and services to consumers.
  • the business user 201 calls, and gets identified as a new business user utilizing a business user registration procedure the business user is asked to say certain business registration information, including for example their name, name of the business, phone number, credit card number, and/or other pertinent business information.
  • the system 102 compares the information provided by the new business user 201 with the information that resides in the database 134.
  • the database includes information regarding business so that the authenticity of the attempted registration can be verified with reasonable assurances. In the even that the information does not match, the system 102 may connect to other databases in an attempt to verify the authenticity or otherwise complete the registration. If the information matches, the user registration is completed.
  • the user is notified with a message providing the new business user with additional options or information, or to recommend trying to say the information again.
  • the new business user is prompted with the audio message "Sorry, but the information you have provided does not seem to be correct, say 'again' to start over. You can also hang up and call again, say 'help', or register at our web site www.Talk41 l.com".
  • Voice recognition can be used (in conjunction with an optionally previously stored authentic voice print of the business user) to authenticate the business user 201 in addition to or instead ofthe password 212 depending on the quality ofthe speech recognition technologies used and the quality ofthe line or other communication ling connecting the business user to the system at the time. So called "caller identification" available in some areas may also assist in verifying the identity ofthe business user where the business user would then be required to call from a registered telephone number.
  • the recorded short message is published then he or she is requested to provide an additional longer message that may be or include a special promotion, directions to the business location, or any other information that will provide additional information to the callers.
  • Both of these short message and long message are available for playback to callers and can also be viewed in text form by those who visit the web site and look up that particular business.
  • the new business user can change either message completely over the phone, or edit it word by word on the PC connected to the web site.
  • Opportunities to subscribe as a category sponsor, and to record and publish a category sponsor message, provide a service description, and to review or change messages are provided.
  • the messages provided by the business may either be a representation ofthe business representatives own speech which is preferred so that the quality and character of the voice is maintained, or the message may be computer synthesized speech. The later being necessary if the business chooses to provide or later modify the message using text input on a computer. As maintaining original speech may be somewhat cumbersome, additional fees may be levied on the business for providing actual speech as compared to synthetic speech.
  • the business user may be able to select from a set of available synthesized voice types so that the voice, even though not provided by the business directly, provides the intended feeling or emotion associated with the business. For example, a restaurant may wish to convey the feeling of romance.
  • the business user may choose a voice talent to record the message, or choose to have synthesized speech simulate a celebrity voice either living or deceased.
  • the business user calls the decision is made as to whether the calling business user is a new user or in existing user. Once it is determined that a new account needs to be established, registration of the new business account proceeds as described above wherein the calling business user provides certain business registration information register the new account (Step 302). The registration information provided by the registering new business user must be verified (Step 304) before the new business user interaction can continue. If verification cannot be made, then the interaction is terminated (Step 306), otherwise the business user is prompted to record a short message (Step 308).
  • the business user can then approve the recorded short message 214 or change the recorded short message until the business user is satisfied with the recorded short message an approves it for publication (Step 310) at which time the short message is published in a voice form and in text form on web site 134 (Step 312).
  • Business user 201 is then prompted to optional record either no message, and long message 218, or sponsor message 219 (Step 314). In the event that the business user to record no additional messages, the business user is thanked for providing the information (Step 334) and the interaction terminates (Step 336). If the business user chooses to record a long message than the business user records the long message (Step 314)
  • Step 318 a given on opportunity to approve the recorded message or change that message until the business user is satisfied with the recorded long message.
  • the long message is published (Step 320) and the business user is again the given opportunity to learn about category sponsorship (Step 322). If the business user declines the opportunity to learn about category sponsorship, the business user is thanked for providing the information (Step 334) and the interaction terminates (Step
  • a category sponsorship message is a message will come up when the caller requests businesses in a category without a specific business name in mind.
  • the system will play back the message of the sponsors in that category in a pre-determined order, random order or a dynamically defined order based on some policy or membership level or category, or other rules. If the business user declines the opportunity to subscribe, the business user is thanked for providing the information (Step 334) and the interaction terminates (Step 336). If the business user indicates a desire to subscribe, he or she is given opportunity to record a category sponsorship message (Step 328) and further opportunities to either approve or change the message until he or she is satisfied with the recorded category sponsor message (Step 330). The category sponsor message is then published (Step 332) and the business user is thanked for providing the information (Step 334) and the interaction terminates (Step 336).
  • the procedures associated with the repeat or registered business user interaction are substantially the same as, though not identical to, those just described for a new business user interaction.
  • the differences primarily concerned how the initial phase of the business user call to the system is handled.
  • the system receives the business user call and determines if it is a new user or an existing registered user (Step 352). If the system determines that it is a new business user, then the procedure already described relative to FIG. 3 is executed. However, if the system determines that an existing registered business user is calling into the system, it presumed step the existing business user wishes to make changes to one or more ofthe items of registration information or to one or more ofthe recorded messages (Step 354).
  • Step 356 If the business user decides after placing the call that he or she does not wish to make changes than the interaction terminates (Step 356), otherwise the business user is asked whether he or she wishes to change the short message, the long message, or the sponsor message (Step 358) and given opportunity to change one or more of these messages.
  • These messages are the Long, Short and Sponsor messages that he may have already input into the system via a phone or a personal computer of other information appliance.
  • additional steps can be introduced or some of the shown steps can be deleted from the interaction flow.
  • the Business User makes changes to the short message, he can be prompted to see whether he wants to make any changes to the long or the sponsor messages.
  • An example of the deletion of a step can be where the user is initially prompted to find out whether he wants to make changes and gets told that he can say anytime "make changes" and trigger the menu options.
  • the amount of consolidation largely depends on the speech recognition technologies employed and the key words chosen for the speech recognition vocabulary.
  • embodiments of the invention may largely or entirely eliminate the particular command and data extraction procedure set forth in the above described procedures and replace them in all or in part by a natural language recognition and extraction procedure that either listens to the user's request in free form speech and extracts commands and/or data from the user's speech, or extracts the commands and/or data in part and intelligently asks additional questions of the user for any added information.
  • the inventive system and method provide logic for conducting a dialog or conversation with the caller. Essentially the same or substantially the same information is exchanged between the user and the system but with a more flexible interface that is more familiar and enjoyable to the user.
  • FIG. 5 shows an exemplary embodiment of the General User interaction 402 such as might occur when a consumer calls for information.
  • a greeting message 382 is played back (Step 404) such as "This is TALK411 , your best source for local information” followed by a sponsor message 384, such as for example "brought to you by Dialsurf, bringing the web to your phone” (Step 406).
  • This sponsor message is typically a paid message by a sponsor.
  • the voice menu 386 is played back to the caller (Step 408), such as "Please say your selection: Restaurants, Lawyers, Auto dealers, etc.”.
  • Step 412 a request message
  • This criteria 388 is pre-determined and varies according to the type of business. In the case of restaurants for example, it may be "type of cuisine, city and zip code”. In case of lawyers, it may be for example "type of practice, city and zip code”.
  • the system will play back a number of business names pre-determined by the system (Step 418). These names can be picked from the database 134, in the requested category, in a pre-determined fashion, randomly or based on a dynamically changing criteria or some fixed set of rules.
  • the inventive system, method, and business model or operating method is applicable to a broad variety of business and merchant types including but not limited in any way to: restaurants, physicians and surgeons, auto parts, auto repair and service, pizza, auto dealers, department stores, attorneys/lawyers, dentists, hospitals, insurance, beauty salons, banks, plumbing contractors, florists, as well as many other types of businesses and services.
  • the system may also be adopted to professionals, or other individuals or organizations other than those specifically qualifying as businesses or merchants.
  • One example of pre-determined way is for the subscribed businesses to pay the corresponding fees to be included in the top category (Category #1), second category (Category #2), and the like for a specific time period.
  • An example of random procedure can be, as the name implies, based on a random number generator that picks a database record in the category requested.
  • An example of dynamically changing criteria is when users rate the businesses on a real time or periodic basis and which ever business is rated highest gets to be heard as the #1 (first named), #2 (next named), #3 (third named), and the like down a hierarchical or other list.
  • Step 454 the caller is invited to say the 'number of the menu selection' or to say 'more" (Step 454), if the caller responds with the number ofthe selection, the number and a short message is played back (Step 450) and he or she gets prompted with a questions such as "Say connect or 'more' for additional information" (Step 452). If the caller says "connect”, the caller is connected to the phone number that was found (Step 444). If the caller says "more” (Step 436) then the pre-recorded Long Message is played back (Step 438) with a question such as "Say connect or just hang up your phone” (Step 440).
  • Step 444 the caller gets connected to the phone number
  • Step 446 the service
  • different rules may be applied to permit the user to input different choices, however, in some situations it is desirable to have a user call in again when they have rethought their need rather than to tie up the connection for an extended period of time.
  • Step 456 If the caller responds by saying 'more', then additional pre-selections are played back to give the caller more and different choices (Step 456).
  • the caller may then either say the number of one ofthe new selections (Step 458) or terminate (Step 446).
  • the caller may be permitted to keep repeating the request for more choices until all choices available in the data base (or a predetermined number of such choices) have been presented to the caller. In either event, if the caller does not like or select one of the available choices, the call terminates (Step 446).
  • Step 420 If there is no match (Step 420), an answer such as "sorry, but we cannot find this category in our list, try again” is played back (Step 422). After a predetermined number of tries (for example, after two tries) if there is no match, the system will say something like "sorry, we could not find a business that matches your request, please call us again” (Step 424) and terminate the call (Step 426). If there is a close match, the system will play back the match to verify the request for further action (Step 428).
  • Step 450 the number and a short message is played back (Step 450) and he or she gets prompted with a questions such as "Say connect or 'more' for additional information" (Step 452). If the caller says “connect”, the caller is connected to the phone number that was found (Step 444). If the caller says “more” (Step 436) then the pre-recorded Long Message is played back (Step 438) with a question such as "Say connect or just hang up your phone” (Step 440). Based on the caller's selection, either the caller gets connected to the phone number (Step 444), or gets disconnected to the service (Step 446).
  • Step 448 If the caller says the name of a particular business instead (Step 448), then the phone number and the Short Message (refer to FIG. 3) will be played back (Step 450) with an additional prompt (Step 452), such as "Say 'connect' or 'more' for additional business information (the Long Message per FIG. 3).
  • Step 438 the user will be prompted once more whether the connect or terminate the call (Step 440).
  • FIG. 6 shows an exemplary implementation of the Directory Service of the information center 136, where the Web Server 471 serves VXML or HTML/XML pages 470, the LDAP Server (Lightweight Directory Access Protocol Server) 472 runs over TCP/IP and provides quick response to high volume lookup to the Database 473.
  • the Middleware 474 is the layer of software that integrates operations of the Web Server, LDAP Server, and the Database (or any additional software such as Transaction Server).
  • LDAP Servers and operation are known in the art, and is described, for example as of 23 March 2000, at: http://www.umich.edU/ ⁇ dirsvcv/ldap/doc/guides/slapd/l.html.
  • An example of a Web Server for high volume application such as TALK411 is the Microsoft Internet Information Server (IIS).
  • IIS Microsoft IIS runs on Windows NT Server.
  • LDAP also runs on Windows NT® 4.0 using Service Pack 4 or later, Windows® 2000, or Windows 95/98. All systems desirably have TCP/IP (or an equivalent capability) installed.
  • one desirable promotional feature involves issuing an audio coupon (also referred to as voice coupon) to a consumer user of the inventive system.
  • a consumer user is issued an audio coupon entitling the user to a promotion.
  • a promotion would entitle the user to a discount to be applied to the item or service purchased when the consumer user connects using the inventive system and method.
  • This discount for example 10 percent off, would only be available to the consumer user when using the inventive TALK411 system and is therefore an enticement for a consumer user to use the inventive system rather than dealing with the business through conventional means.
  • Other promotions might involve a buy one get one free offer, of free drink with order of food type offer, or any other the other variety of promotional offers typically made in the retail trade between merchants and consumers.
  • the audio coupon may be provided in a variety of ways.
  • the business would become aware that the consumer user contacted the business using the inventive TALK411 system and automatically give the consumer user a discount (or other promotional item) when the your was placed.
  • the consumer user might be given a coupon code which could only be available to a consumer user who ate utilize the inventive system, and a consumer user would provide this code to the merchant upon connection.
  • This code might be generic to the business or particularized to that specific transaction. Therefore, in addition to be self promotion aspects ofthe business model, the optional use of audio coupons also provides considerable business advantages.
  • the system inserts a message to the merchant after the call has been connected to identify the caller as a valid service user and to validate the audio coupon.
  • the use of audio coupons is integrated with the world wide web or Internet in that the audio coupons may be identified, stored to, retrieved from, or otherwise processed using the businesses or the inventive services web site.
  • the consumer user is not limited to using the coupon at the time it was earned, but may instead be collected for later redemption. This also affords an opportunity to obtain a printed copy of the coupon for use at any later time.
  • the audio coupon or a coupon derived from that coupon may be delivered to a personal data assistant or PDA (such as for example, an email enabled PALM VII) so that the PDA stores the coupon and serves as a medium for displaying the coupon the business, merchant, or organization.
  • PDA personal data assistant
  • one aspect ofthe inventive system, method, and business model is to collect money or other revenue in what ever form for each coupon delivered. It is also advantageous to collect money or other revenue for each coupon redeemed either as a fixed amount per coupon or as a portion (such as a percentage) of the sale, or both. Collection of revenue for each coupon delivered is separate from collection of revenue for each redemption or sale.
  • a Professional Message Conversion Service and Method Embodiment is optionally provided and provides a capability for converting recorded or typed (character based) promotional messages to professionally recorded message.
  • the user Once the user records the promotional message and saves it, the user is prompted to keep the message the way it is recorded, or to get the message re-recorded by a professional audio talent. Then the recorded message is sent to the mailbox ofthe audio talent. He or she records the message and saves it. An email is automatically sent to the user notifying that the message is recorded and will be activated once he/she reviews it.
  • the system has geographical context provided by a known location of the caller.
  • a known location of the caller For example, it is expected that mobile or cellular telephones will have capability to self locate, either using internal satellite-based Geographical Positioning System (GPS) means or by using various schemes known in the art for determining (or estimating) the location of a cellular telephone based on proximity to cellular base stations, hand-off s to base stations, and similar techniques.
  • GPS Geographical Positioning System
  • the inventive system provides for geographically-based recommendations, geographically-based promotions, as well as for geographically based audio coupon delivery.
  • the geographic proximity may be established according to some set of rules which may for example depend upon the density of business establishments in the local area.
  • the geographically directed audio coupons pertain to business within one to a few blocks of the callers location, in other embodiments to a mile or two, and in still other embodiments to the region ofthe city or town.
  • Caller ID may alternatively be used when an address associated with the telephone number is available.
  • consumer user's who call into the inventive service will be able to rate the particular business after they have utilized the businesses goods or services.
  • a consumer user having been referred to a restaurant using inventive system can later call in using a toll-free or free local phone number and provide feedback, such as in the form of a rating, relative to their experience. These ratings would then be compiled and made available to the local businesses. Ultimately such feedback would encourage the businesses to either maintain their high quality of service or to improve the quality of their service and/or goods in response to the consumer user's rating.
  • these ratings were also serve as an additional information source for consumer user's and would be available either or telephone or on Internet based website.
  • the business establishment having demonstrated a particularly high-level of goods or service based on these ratings would be placed into a category of highly rated businesses, such as "BayHits", would be available to the consumer user during his or her call into the system. So for example, when the user calls in to request "Italian restaurant in Palo Alto", if in one ofthe candidate restaurant played back to the caller happens to be a "BayHit" then that restaurant would be indemnified as such. For example, the caller might receive a message "II Fornaio - a
  • the rating or BayHit feature may be provided free to the businesses while in an alternative preferred embodiment businesses falling within the highly rated or "BayHit” category would be charged the nominal fee.
  • This rating and promotion scheme may be implement in a variety of ways and that the particular descriptions provided here are merely exemplary ofthe more general method.
  • the ratings may alternatively or additionally be provided on an Internet website (such as http://www.bavhits.com so that information obtained from caller's using inventive system method would be available to other individuals and businesses as well.
  • BayHits is the name for a promotional vehicle for local businesses to promote their products and services through the telephone and the web to the San Francisco Bay Area (or any other) population. Analogous names may be applied to other regional markets. Businesses offer a promotional discount of some sort in order to participate and consumers have to register and participate in the "ratings" in order to earn points redeemable towards subscribed discounts. For example, John Doe registers either at the web to be a BayHits Rater. He is immediately given some points (for example 500 points). He can spend these points in exchange for "deals" from participating businesses. Once he calls in and rates one of the participating vendors, he will earn points for each rating. Ratings will be made available to the participating vendors on the web site.
  • BayHit Furthermore, when callers call Talk411 , they will be able to ask for "BayHits" in a specific Category and get connected. In one business model participation in the BayHits program will be free until brand is recognized and momentum is built with participating vendors. Subscription to the Talk411 service is a prerequisite to become a participating business in the BayHit program.
  • An optional Audio Talent brokerage portal provides a list of audio (voice) talents. In a reserved area of the web site a list of audio talents are provided to the business users. Once the business user clicks on the audio talent, a pre-recorded message is played back. The user selects to purchase the services ofthe audio talent of choice, and provides payment means, such as for example credit card or other account information necessary for the transaction.
  • Audio Coupon redemption tracking mechanism by auto decrementing a total count of purchased coupons. Audio coupons are purchased through the phone service or through the web site. Once the coupon message is saved it is ready for delivery to users. Once the user gets connected to the business who has purchased the coupons, a coupon counter gets decremented by one. The balance of available coupons is updated in the business account information provided to the business on the phone or on the web site.
  • Promotional messages may be displayed in an Internet web-based search box. These messages may be rotated based on predetermined criteria, such as x many messages in y minutes. When the user hovers or clicks the mouse or other pointing device over the message box the message will get fixed. When the user hits return, a page from the Talk411 web site will be served with information about the business with the displayed message. If the user starts typing, the box will be cleared and accept the keystrokes of the user. Facsimile Business Information Input Service Embodiment provides for fax input of business information that gets scanned and converted to text and get played by TTS over the phone.
  • a fax back service will be provided, where when the user calls in to the Talk411 service and wants to be a member, he/she will be faxed a form page with fields of information that need to be filled out and faxed back to Talk411. This information will be scanned, edited and accepted to the system. Then a fax will be sent back to the new Business User notifying that a membership is formed and that the user should call Talk411 to check and accept the information.
  • a new business manager/owner calls in and registers to be a member. She/he is asked to provide a credit card number for business information verification and services. Then, she/he gets prompted to verify the existing information and subscribe 3 months free of charge to the basic services where she is told she can record her promotional or informational message at the prompt. Then she/he is given a chance to review the existing business information in the Talk411 database, edit information if not correct, review and publish. Then she is given a chance to review and publish her promotional or informational message and asked if she wants to learn about enhanced membership services. If she/he says yes, then she is briefly explained what those are and given a chance to subscribe. She/he is told she can always change her message, subscribe for additional services either on the telephone or on the Talk41 l.com web site.
  • FIG. 7 is a flow-chart diagram setting out the caller flow and interaction with the system for a method of interaction between the inventive system and a caller.
  • FIG. 7 The flow-chart of FIG. 7 is broken into FIG. 7 A - 7E for convenience in presentation.
  • the flow-chart includes call flow procedures as well as procedures for articulating messages to the caller and for receiving inputs from the caller and may serve as a basis for developing the call handling procedure and the talk41 l .vs script file.
  • FIG. 8 FIG. 8A-8B
  • an exemplary embodiment of a second different call handling procedure 700 is provided.
  • This is another example of a call handling procedure which, like the others provided here, may form the basis for a VOS, voiceXML, or C++ based script implementation. It starts 702 with receipt of a telephone call 703.
  • the call may be directed in one of two ways, to member services 704 or to information services 750.
  • coupon options and/or coupon pricing 725 are provided with information concerning coupon options and/or coupon pricing 725 and then given an opportunity to buy coupons or obtain additional information 727. If they instead select sponsor, they are provided with information concerning sponsorship category options 726 and then permitted to buy a sponsorship or obtain additional information 727. Should the business member then chose to receive neither coupon nor sponsor information, the call terminates or ends 749.
  • the caller chooses to receive additional information, they receive a benefits pitch 728 and are then given an opportunity to subscribe or verify 729, and if the caller business member chooses subscribe, a request is made for credit card information (or other account, billing, or payment information) 743. Credit card (or other account, billing, or payment information) is also collected if the earlier caller decision was to buy coupons or a category sponsorship 727. If the caller chooses to verify 729 and the information provided cannot be verified 730, then the caller is connected to customer service 731. Customer service may be an automated system, a live operator, or a combination of the two.
  • the caller is asked to select 744 if they wish to subscribe in which case the credit card information is obtained 743, end the call 745, or obtain more information (including at this point information that may only be available from a customer service representative 731 , but also from a benefits presentation or pitch 728.
  • the caller is asked whether they wish to record the promotional message or coupon over the phone at that time or using one of the available internet web based recording interfaces 747. Recall that the web based interfaces permit voice recording, typing the promotional item for automated text to speech conversion, utilizing a voice talent, using a synthesized celebrity or historical characters voice, or other option.
  • the caller If the caller chooses to record now, the caller is given an opportunity to record and verify the recording including opportunity to re-record until satisfied 748. After selecting the recording option, the caller is again asked to choose other options relating to sponsorship and coupons 724 as already described. It is noted that other options and call flow may be added or modifications made and that the particular choices available at any point may readily be altered. It is further noted with reference to the flow chart in FIG. 8, that after the caller selected member services 704 and identified themself as a new member 705, somewhat different information is required before being asked if they wish to subscribe or verify 729 or be connected to customer service 731 , where the afore described procedures are repeated for the new business.
  • the system asks if they are a business 737, and if they identify themself as a business provide their business name and telephone number 740. It the matching business is found, they are asked to subscribe or verify 729 and which point they make a selection and proceed as already described for existing members. In on the other hand they are not a business 737, they are requested to provide their telephone number and password 738, and to chose between setup over the telephone and web based set up 739. If they choose web-based set up, set up will occur over the web at a later time, and the call ends. If they do not select web set up, the caller is given an opportunity to learn about specials and avail themself of specials as desired 713.
  • the caller may then choose 714 to connect to a business or modify from a list 715. This opportunity is repeated so that the caller may select different available options. Once the caller chooses to connect to a business, the caller is connected 716, and the call to the service ends 717.
  • the members identity is obtained and verified 707. That caller member is then asked to choose between ratings, points check, or My 708. If the caller chooses My, the caller presented with specials 713 as previously described. If the caller selects ratings check the caller is advised ofthe business and rating 710, and if the caller selects check points, is advised ofthe current points earned 709. The rate/check/my loop iterates so that the caller can cycle through and obtain more than one type of information while connected.
  • the incoming call 703 may be a request to obtain information services from a non-member (or even from a member) caller 750.
  • the request will be for one of four possible information types, though other types may clearly be added, and some existing ones deleted.
  • a request may be for a yellow pages (YP) type request where the caller desires to find matches in a category 752.
  • YP yellow pages
  • a predetermined number of matches are provided, in this case three are provided at a time, with a total of up to nine if additional matches are requested 756. These numbers may readily be modified according to response and need.
  • a different request may be to find coupons 753. Agin three are initially provided in the requested category, with up to fifteen being provided as requested 757.
  • the request may also be a request to find BayHits 754, and in similar manner three matches are initially provided with up to nine total matches as requested 758. After any of these three options are exercised 752, 753, 754, the caller is given the opportunity to select one ofthe matches or request more 759.
  • the caller may alternatively make more specific 411-type (but with added features) directory assistance type request for the name of a business 751 , optionally specifying the name and coupon number 755. In this situation as well as when one of the other options were selected 759, the caller is given an opportunity 760 to connect to the business 763 or to obtain more information about the business 761 before connecting.
  • the caller chooses to connect 763, the call is announced along with any promotion or coupon promise 764, at which time the call to the service ends 765. If the caller requested more information 761, such information is provided, and the caller is asked to choose between connecting to the business 763, ending the call 765, or returning to information services 750.
  • the inventive system and method are also implemented from an Internet or other networked computing environment using conventional keyboard, mouse, and display interfaces or with the addition of voice input and speech-recognition capability. Therefore it should be understood that the system and method described relative to the voice recognition embodiment may be implemented from a personal computer or other information appliance that supports voice input, such as using a microphone and modem connection to connect over an analog or digital telephone or other radio- frequency, optical, cable, or wireless communication link. Alternatively, though with some sacrifice in ease of use, the voice recognition component may be substituted by a typed or character string based interface.
  • this section shows various web screen shots and the manner in which web pages are organized and navigated to provide an easy efficient and enjoyable experience. It is further noted that certain options are provided in the voice-recognition implementation to conduct some ofthe more detailed and time consuming steps at a later time using the talk411 inventive system.
  • the Talk411 Internet application is developed using Microsoft's IIS (Web Server) and Active Server Page (ASP) technology.
  • the Active Server Pages access the same database as the Voice system and deliver similar information.
  • the Internet web-based Talk411 application includes both HTML and ASP files, both of which file types are well known in the art.
  • ASP files are HTMP files that include some active content.
  • Exemplary html and asp code is provided for the getbusiness.asp file, a portion of which pertaining to "Dim Business" is duplicated in Table X.
  • Angle brackets identify key words, and anything that starts with a bracket and a % sign is active server page code.
  • the indicator "html” is a label for the html page.
  • Coupons. asp Gives user a list of businesses with coupons to select from (605)
  • GetTypes.asp Gives user a list of valid business for a specified type (608)
  • UpdateBusiness . asp Validates business information from MemberMenu.asp for update (610)
  • RateBusiness.asp Validates business rating and updates database (611)
  • Rate.asp Allows user to select a rating for a business (612)
  • Table XI Exemplary portion of active server page and html page code for talk411 Internet application.
  • FIG. 9 is an example ofthe Coupons.asp and gives user a list of businesses with coupons to select from (605).
  • This particular embodiment of the coupons web page interface provides Find A Business, Search By Type, BayHits! , Coupons, and Members button or menu bar.
  • the coupons web page also provides a scrollable list for
  • FIG. 10 is an example of a GetBusiness.asp page (607) and provides information for a particular business identified by the user.
  • the web page presents similar buttons to those on the Coupon web page so that the user can readily navigate between the pages using a common interface.
  • the get a business page also provides a menu button for selecting another business in the same category. In this case, other auto repair and service. BayHits and a rate me menu button are also provided.
  • FIG. 11 provides an example of a member menu web page generated by a MemberMenu.asp file that validates member ID and password, displays menu (609). It provides a box for entry of a member ID number and another box for entry of the member Password. The user enters this information and clicks a submit button to submit the information and begin processing. Security measures as are known in the art may optionally be implemented.
  • FIG. 12 provides an example of a member menu for purchasing and editing a coupon message.
  • Other embodiments ofthe web page for coupon generation provide templates for entering and arranging the information so that the member may try different alternatives for web based coupons that may be printed. It may also provide an interface for picking voices, sampling voice talent, selecting celebrity voices, and selecting simulated voices for text-to-speech conversion. These may be tested at the web site and made available for the telephone handset based implementation.
  • FIG. 13 is a diagrammatic illustration showing the relationships between the various web pages and web page asp files described in Table XL Various other paths between web pages are possible and may be used. Alternative web pages may be designed, and features from multiple web pages may be combined into a single or into fewer web pages.

Abstract

The invention provides a system (102), method, and business model for an information system and server having business self-promotion, audio coupon, ratings, and other features. A business or organization in which consumers call into a service using an ordinary telephone, PC, PDA, or other information appliance, and make requests in plain speech for information for goods and/or services, and the service provides responses for the request in plain speech in real-time. Provides an operating model for a telephone-based audio-interfaced goods and services information and referral service having merchant self promotion features, including database provider storing merchant information; an interface for inputting merchant information in the database and for retrieving and editing the information; and an interface for inputting voice commands and data and for receiving merchant information and processed information from the database in response to the input. The invention provides: a speech-to-text conversion engine (146), a text-to-speech conversion engine (148) and a speech server (117).

Description

VOICE-INTERACTIVE MARKETPLACE PROVIDING TIME AND MONEY SAVING BENEFITS AND REAL-TIME PROMOTION PUBLISHING AND FEEDBACK
Related Applications
This application claims the benefit of priority to U.S. Utility Application Serial
No. 09/534,700 filed 24 March 2000; and to U. S. Provisional Application Serial No. 60/219,079 filed 18 July 2000; each of which applications are herewith incorporated by reference.
Field of Invention
This invention pertains generally to voice-recognition based or speech- recognition-based interactive electronic commerce, and more particularly to systems, methods, and methods of doing business for providing automated interactive directory assistance information from a business or organization to a consumer in need of goods and/or services. The invention pertains even more particularly to systems, methods, and methods of doing business for providing automated speech-recognition driven query and response with business or event self-promotion features relative to businesses and events over ordinary wired or wireless telephone systems, PC systems, Personal Data Assistants (PDAs), and other communication and information appliances and devices.
BACKGROUND
Locating business establishments, such as for example a restaurant satisfying the particular need of a customer, has hereto for generally required access to printed directory listings, or more recently access to the World Wide Web using a personal computer. The availability of such references is frequently quite limited at the time the consumer desires to avail themselves of particular goods or services. For example, an out-of-town visitor driving in their automobile and approaching San Francisco might decide to stop and have dinner in a fine Italian restaurant and more particularly might like to have dinner in a fine Italian restaurant located in the particular area of the city., or to partake of a particular gastronomic delicacy. That visitor would likely not have a printed directory in their automobile or mobile access to the Internet to search for a restaurant satisfying their current need. Therefore, the visitor would likely either have to stop and asked for recommendations or drive around until a restaurant satisfying their needs might be located. Perhaps tens of minutes or hours later than desired. This approach is clearly inefficient, and the visitor may not have the dining experience expected if the restaurant they happen to see while driving turns out to have poor quality food, poor service, or both.
An analogous dilemma arises for other goods and services, whether provided to the local residents or to a visitor from out of the area. Frequently information is not available to a consumer (perspective purchaser) when he or she needs such information, and with the proliferation of a fast mobile lifestyle, there exists and need to provide such consumer information with readily available information appliances, such as conventional telephones, cellular phones, or other pocket or mobile devices that can provide connectivity to a service at minimum cost.
Frequently such device will have only sparse input/output capabilities. For example, a cellular telephone will typically have only a few display lines presenting text or symbolic data to a user, but has substantial audio input and audio output capability that can be used by the consumer. Heretofore, speech-to-text conversion has generally been limited to word processing and or computer or control applications as the has required fairly substantial processing power and memory within a computer device. For example, speech to text conversion products made by Dragon Systems (Lernout & Hauspie, Speech Products U.S.A., Inc., 52 Third Avenue, Burlington, MA 01803) generally require an Intel Pentium II or Pentium III microprocessor, AMD K-6 or Athlon processor, or the like running in excess of 450 MHz and 128 MB of memory. This technology is not available in conventional or mobile telephones at this time. Text- to-speech conversion has been known but has not been utilized to provide an interactive interface between consumers and consumer information from telephone systems. Such continuous speech recognition systems also usually require voice training with the ultimate user to provide satisfactory results. Furthermore, even for systems which provided some degree of consumer information over the telephone, such systems have either not attempted to generate business revenues through their operation, or have been unsuccessful in generating significant revenue in this manner. In part the lack of revenue success has been due to a low level of business participation in such systems, the inability of a business to control or modify their message in response to short-term business needs or to sell promote their businesses, as well as the lack of a particular incentive for a consumer to par take all of the information offered by the service. In fact, there may frequently have been a cost associated access to conventional information and referral services by consumers, even if only by virtue of the directory assistance by local telephone service providers.
Some conventional systems and methods have been limited to playback of recorded audio or audio playback corresponding to the content of web pages; but such systems have not integrated Internet or web-based interactions with voice or telephone based information provision. They have also frequently provided inferior voice interfaces that have annoyed callers rather than having provided a useful information experience.
Local as well as national businesses (really any business, merchant, marketing, or other organization) and their customers have made significant sacrifices in order to find each other and conduct a transaction for goods or services. These sacrifices result at least in part from the nature of the information resources available to match businesses and their customers, primary among these information sources are the printed Yellow Pages, on-line Yellow Pages, conventional 411 directory assistance, print media coupons, and on-line coupons.
Businesses (particularly local businesses) make sacrifices when relying on printed Yellow Pages because they are expensive, difficult to sort prices and options, cannot be changed once in print, and their effectiveness cannot readily be determined without additional time consuming and expensive surveys. From customers, or potential customers, perspective sacrifices are made because they are too heavy to carry, present too many choices so that it is difficult for a customer to figure out which one to call as they usually lack the information that may effectively provide the decision criteria, the information is usually stale (12-16 months old is typical), and many customers may forget to take advantage of coupon offers even when available. Printed
Yellow Pages and related types of large bulky paper directories clearly have other problems and limitations.
Even on-line or Internet Yellow Pages or directories have limitations that present sacrifices for both merchants and their potential customers. From the merchant's perspective, for example, there are many different on-line directory providers and a business must make a decision as to which one or ones to associate with, a personal computer is required to access and update such directories and may not be available when and where needed, updating may require payment to an ISP or programmer, the directory reaches only those with an Internet connection, and additional fees must normally be paid to place ad banners. From a customer's perspective, the information provided is frequently inaccurate and limited, an interne connection is required to access the information, and either a personal computer or WAP enabled device is required to access.
Local directory assistance and more recently national directory assistance through either a 411 type information service is also limited. From a local merchant's perspective, even when a potential customer receives a correct number the connection may not be made because either the customer must redial to get the number or pay an additional fee to be connected. From a customer's perspective, there is relatively easy access but at a cost of between about $0.50 and $1.00 per call. The directory, particularly when using the wireless directory assistance, is notorious for providing wrong numbers and there is no ability to get the correct number without re-dialing and paying an additional fee. Additional charges are also typically billed for requesting address information if available.
Reliance on print media coupons also entail sacrifices. For example, they are static and cannot be changed once in print. It takes a relatively long period of time between developing the coupon promotion and getting feedback as to its success or failure. Redemption dates cannot be changed, and one merchants coupons frequently get lost in the noise of other unrelated or competing coupons. Customers find it difficult to identify relevant coupons, find it a hassle and hardly worth the time or effort to cut them out or save them, and have a difficult time keeping tract of expiration dates. In some social settings, they present a questionable social image, and cannot always be carried with the person so they are not available when an occasion to use them arises.
On-line coupons are somewhat of an improvement however they still present issues. They are expensive to place at high traffic portals for local merchants, and may still be lost in the noise of other promotions. There is low traffic at coupon only sites and their reach is limited due to the need for an Internet connection. The use of coupons may also be favored by groups that may not have ready access to the Internet.
Customers also find them hard to locate and a hassle to pint, copy, or cut out. Access by many groups of persons, or by persons at the time they consider making a purchase may be limited.
In addition to the limitations and sacrifices made by merchants and customers relative to establishing a contact for the provision of goods or services, there currently exist additional problems and limitations relative to wireless data communication of advertising information. For example, infrastructure and methods have not been established or are in their infancy and must be developed before true wireless advertising can become widely available and accepted. Data ready telephone handsets exist but are in the minority and typically represent higher-end and more expensive models. Mobile phones in the hands of the majority of consumers do not provide for wireless interaction with merchants or for the receipt of audio advertising, marketing, or promotional information. Nor are they typically used for voice recognition applications. Merchants have been hesitant to participate in wireless data transactions and the lack of consumer interest in marketing messages generally have contributed to lack of development and progress in this area.
Techniques for building speech recognition applications are some what in their early development stage, however, some information is provided in the reference How to Build a Speech Recognition Application - A Style Guide for Telephony Dialogues, by Bruce Balentine et al., ISBN 0-9671278-1-5, published in 1999 by Enterprise
Integration Group, Inc., 2410 San Ramon Valley Blvd., Suite 225, San Ramone, California 94583; which reference text is hereby incorporated by reference. Therefore there remains a need for a system, method, and business operating model and method that overcome these and other limitations. More particularly, there remains a need for a method of doing business, an information, directory assistance service and referral service providing easy access by businesses and consumers, as well as providing business self promotion and consumer feedback features that encourage use, generate revenues, and provide incentives for use by both businesses and consumers. Such services should advantageously provide more information than traditional 41 1 type directory assistance in terms of greater amounts of information, greater accuracy and currency of information, award programs, and other features that encourage businesses to participate and consumers to call and utilize such systems.
SUMMARY
The invention provides a system, method, and business model for an information system and service having business self-promotion features, including voice coupons, discounts, sales promotions or other special deals, ratings, and different categories of sponsorship and visibility to the calling public. In one aspect, the inventive business model is directed to a business in which consumers call into a service using an ordinary telephone, PC, PDA, or other information appliance, and make requests in plain speech for information and positive referrals on goods and/or services, and the service provides responses to the request in plain speech in real-time over the same telephone, PC, PDA, or other information appliance. The business model may further include providing a facility for a business to communicate a self-promotion of the business to the requestor, as well as providing an audio promotional coupon (or other promotional item) to a requestor when the requestor completes a call to a business using the service.
In another aspect the inventive system extends these features to the Internet and inter-operates with the Internet in a web page based embodiment to provide an additional portal and greater interactive capability.
In another aspect, the invention provides an operating model for a telephone- based audio or speech recognition and text to voice interfaced goods and services information, enhanced 41 1 type directory assistance and referral service having merchant self-promotion features, comprising: an information database provider storing merchant information; a merchant interface for inputting merchant information into the database and for retrieving and editing the information; and a consumer interface for inputting voice commands and data and for receiving merchant information and processed information from the database in response to the input voice commands and data. The operating method may provide that the consumer interface comprises a telephone handset, and/or that the consumer also inputs non-voice commands and data from a keypad on the telephone handset. The operating model may also provide that the telephone handset comprises a mobile telephone.
In another aspect, the invention provides a system comprising: a speech-to-text conversion engine converting speech-based input commands and data received from an external device over a communication link into text-based commands and data; a data base storing a plurality of data items; a database search engine searching the database for a particular data item in response to the text-based command and data; a text-to- speech conversion engine generating a speech-based representation of the particular data item identified in the database search; and a speech server for communicating the speech-based representation of the particular data item to the external device.
In another aspect the invention provides audio coupons that operate as incentives for consumers to use the inventive system.
In still another aspect, the invention provides system and methods for submitting and retrieving ratings for goods and or services.
In yet another aspect, the inventive system and method assists in providing directory driven wireless commerce.
In yet another aspect, the inventive system and method provide a promotion and advertising channel that has geographical and sociological reach and the speed needed in today's dynamic financial and commercial markets.
In still another aspect, the inventive system and method provide a voice- interactive dynamic market place where individuals (particularly locals) call to save and businesses (particularly local merchants) call to publish sales promotions in real-time or near real-time. In still another aspect, the inventive system and method communicate information on an as requested basis that goes beyond the telephone number.
In yet another aspect, the inventive system and method provide instant savings with voice coupons published by local merchants.
In still another aspect, the inventive system and method provide an advertising free initial experience where voice or audio coupons are only heard attached to businesses that the caller has requested or searched for. In even another aspect, the inventive system and method provide for hands-free navigation with voice commands on any telephone or device supporting telephony.
In yet another aspect, the invention provides a low cost 411 directory assistance with added informational features that is easy to access.
In yet another aspect, the invention provides benefits to merchants including but not limited to targeted reach, instant promotion, instant or near- term feedback, and an optional free Internet web presence.
In still another aspect, the invention provides benefits to common carriers and telephone companies who save conventional 411 costs, process higher call volumes, and attract new customers. In yet another aspect, the invention provides a business model in which voice coupons are sold for distribution, a monthly fee is charged for subscription to the basic service, and additional charges are levied and collected for business category sponsorship.
In still another aspect, the invention provides a business model in which the providing organization partners with a print yellow page or other business directory publisher and/or with direct marketing organizations to subscribe merchants, businesses, individual professionals, or other organizations.
In a further aspect, the invention provides a business model in which the providing organization partners with a yellow page, wireless providers, telephone companies, and conventional 411 call centers to generate call traffic and thereby increase revenue.
In yet another aspect, the invention provides a business model in which conventional 411 directory assistance providers are replaced by the inventive system to save carriers 411 costs and to offer a new shared revenue channel and business model. In another aspect, the invention provides a business model in which new customers are solicited and provided with value added services.
In still another aspect, the invention provides a business model in which direct - Si - marketers are provided with a new coupon or promotion distribution channel.
In still another aspect, the invention provides a business model in which new business is brought to direct marketing organizations through the voice channel of the inventive system. In still another aspect, the invention provides a business model in which direct marketing organizations are provided with rapid marketing feedback for their clients and customers.
In still another aspect, the invention provides a business model in which print yellow page and print business directory publishers increase their existing revenues by increasing the yellow page or directory ad size sold by virtue of ad space required by a trademark and/or "voice coupon" icon or logo.
In still another aspect, the invention provides a business model in which print yellow page or other print directory publishers are provided with increased usage of the yellow pages or directory to show which vendors provide immediate savings. In still another aspect, the invention provides a business model in which yellow page or other print directory publishers are provided with an enhancement in their market positioning by virtue of their providing an more complete and compelling offering.
In still another aspect, the invention provides the closest locations for a particular requested category where the location of the caller is known from a caller location input, cellular signal triangulation, GPS position determination, or other position or proximity location means.
In still another aspect, the invention provides means for a merchant to interact with the system using either voice or web interface and select templates for the type of social, economic, political, age, gender, profession, or other image the merchant wants to portray and the type of promotion message the merchant wants to publish with a small number of mouse clicks, key strokes, or voice commands and prompts.
In still another aspect, the invention provides system and method for establishing user groups ("My" ... group) and communities based on lifestyles, usage patterns, interests and interest levels so that a registered user can subscribe to a group of multiple groups where merchant listings and other relevant information is given a priority. Such group or community may include a bicycling enthusiast who can subscribe to the bicycling group and when he calls, he says "My bike" and will then be offered services that has something to do with biking.
In still another aspect, the invention provides means for obtaining ratings, in which once a caller gets connected to the business through the talk411 service system, after a period of time measured in hours the service calls back the caller to ask for ratings or to collect feedback to improve service, where the caller has either registered to permit this inquiry or does not have caller ID blocked.
In still another aspect, the invention provides system and method for merchants to post customer testimonials so that future callers can hear these messages as a reference that may help make a choice of which merchant they want to be connected with, and optionally, as the service gets used callers can leave testimonial messages which the business can choose to post for other users access.
In still another aspect, the invention provides system and method for merchants to post key words on the voice system or internet site which can be used as a search term by the caller and that will be used as a navigation pointer to the posting merchant.
In a further aspect, this key word or phrase based search may be used in connection with a new product or service (new movie, new CD, new restaurant, or any other product or service) so that the caller may speak (or otherwise input) this key word and be matched with one or more merchants offering it. In an even further aspect, priority use of such key word may be auctioned to merchants for priority playback to callers.
In still another aspect, the invention provides system and method allowing merchants to post their promotional message or company information in multiple ways including to record their own, select from a voice talent who would record the text the merchant put in the system using voice, a personal computer, or in any other way; or just type in an let a text-to- speech processor convert it.
In still another aspect, the invention provides a coupon aggregation and translation engine and service allowing aggregation of different formatted coupons from online sites and reformat such coupons to a standard Dialsurf or other format that allows them to be played over the telephone phone. In still another aspect, the invention provides user call back to remind the user to rate a recently used local merchant service.
In still another aspect, the invention provides means for an over the telephone offer for a user to become an instant member to a community or coupon distribution list using email or other communication means once the user asks for a specific category. In a further aspect, if the user is registered and set up "my411" features, the user gets coupons for this category delivered to his "my411" voice box. In still another aspect, the invention provides merchants the ability to identify specials in their "additional information" or "coupon message" that is used to find that merchant when those words are used over the phone in the key word search mode
In still another aspect, the invention provides system and method for publishing secret words in local newspapers, Internet chat rooms and other community oriented online and offline boards, where they can be used on the phone or on our web site as a password to enter a sweepstake or win a prize, and to thereby provide a beneficial marketing tactic to increase sales of print papers and increase traffic to online portals
In still another aspect, the invention provides voice coupon targeting based on area code and prefix, city, geocoded location, GPS location, zip code, cross streets, vicinity of a milestone, major tourist areas, major landmarks, airports, night clubs, entertainment centers, shopping malls, restaurants, and the like.
In still another aspect, the invention provides means and business model for facilitating spread of secret words through the word of mouth initiating from someone in the company to provide access to privileged information, prizes, etc. to enhance the repeat user experience
In still another aspect, the invention provides the use of live agent interaction on a random call basis to provide a surprise element to enhance the user experience, where live agents can be celebrities.
In still another aspect, the invention provides user choice or automatic choice of voice of the Talk411 attendant based on gender, age, interest, and other selection criteria.
In still another aspect, the invention provides user choice to select synthesized voice of a celebrity as the automated attendant.
In still another aspect, the invention provides system and method for insertion of trivia questions where correct answer wins a prize from a local merchant.
In still another aspect, the invention provides a business model in which 41 1 directory assistance call centers are provided with an added revenue stream to improve slim profit margins, to increase call volume by providing a desirable information service, to keep their current carrier customers with value added services, to decrease their operating costs and overhead by reducing the number of human employees, and by overcoming severe local competition on wireless carriers. The invention also provides further apparatus, system, method, operating model and business method, computer program and software, and computer software program products that interoperate with the inventive systems and methods.
BRIEF DESCRIPTION OF THE DRAWINGS Additional advantages and features of the invention will become readily apparent upon reading the following detailed description and appended claims when taken in conjunction with reference to the following drawings, in which:
FIG. 1 is a diagrammatic illustration showing an exemplary embodiment of the inventive system. FIG. 2 is a diagrammatic illustration showing an exemplary embodiment of speech server functionality.
FIG. 3 is a diagrammatic illustration showing an exemplary embodiment of a new business user (merchant) interaction with the inventive system.
FIG. 4 is a diagrammatic illustration showing an exemplary embodiment of an existing registered business user interaction with the inventive system.
FIG. 5 is a diagrammatic illustration showing an embodiment of a general consumer user interaction with the inventive system.
FIG. 6 is a diagrammatic illustration showing an exemplary implementation of the inventive directory service on the Web. FIG. 7 is a diagrammatic illustration of a flow-chart for a method of interaction between the inventive system and a caller, including call flow procedures and procedures for articulating messages to the caller and for receiving inputs from the caller.
FIG. 8 is a diagrammatic illustration showing an embodiment of an alternative call handing procedure by the inventive system and service call center.
FIG. 9 is a diagrammatic illustration showing an exemplary coupons web page. FIG. 10 is a diagrammatic illustration showing an exemplary get business web page.
FIG. 11 is a diagrammatic illustration showing an exemplary member menu web page.
FIG. 12 is a diagrammatic illustration showing an exemplary member registration web page.
FIG. 13 is a diagrammatic illustration showing the relationships of several web pages to each other.
DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION It will be appreciated that conventional methods and approaches to matching a consumer with a merchant, business, or other individual or organization are unsatisfactory at best, and severely lacking, in many situations. The limitations and sacrifices are described in the Background section of this specification. Even with approaches that are more modern than conventional printed yellow pages or telephone number providing only 411 -type directory assistance, the actual improvement measured in terms of merchant, consumer, or industry adoption has been minimal. A combination of technological, philosophical, and psychological needs and forces has conspired against the success of such attempted improvements.
Attention is therefore first directed to an overview of the inventive system, method, computer programs and procedures, and Talk411 service and then to a discussion of certain needs and goals that are desirably satisfied for such an information system to meet the needs of merchants, consumers, and service providers as well. These goals do not rise to the level of requirements, as a system that fails to meet all of the goals may nevertheless provide advantageous utility, but arguably the more goals that are satisfied, the more satisfying the implementation may be. Many of the features described in the following paragraphs are also clearly optional and although they provide great utility and many benefits, would not be required in a basic system or method.
Talk411 is a one-stop fresh information source that offers instant promotion capabilities to local businesses using the telephone or the web. The information service is free for consumers. It is a promotion service for local businesses similar to the Yellow Pages, except it can be accessed with voice interaction on the phone or with display devices equipped with an Internet browser.
The information database (content) may be acquired in any number of ways, including from one or more of the following ways: (i) Purchased from local telephone operators, such as Pacific Bell or other operator, (ii) Input from local businesses who subscribe to the service, or (iii) Content partners who provide local information on the web.
The local businesses can subscribe, access and maintain the information they want to communicate to the consumers in at least the following two ways: (i) Call the Talk411 service and voice navigate/input, and (ii) Browse the Talk411.com and input by typing the information, uploading business information such as logo, etc. The consumers can access the information in several ways as well, including: (i) Call the Talk411 service and voice navigate and speak naturally, and (ii) Browse the Talk41 l.com web site. Selected features that a business may user are now described. These are examples to illustrate some interesting aspects of the invention and are not limitations or requirements of any particular embodiment.
Businesses can, for example: (i) Subscribe, register, and input their business information and promotional messages using the telephone or the web site, (ii) Become a member to the standard service, where they pay a low monthly fee in exchange for extended business information prepared specifically for the category they belong. This information is provided upon information request about that specific business, (iii) Become a Category Sponsor, where a message is played/displayed when a caller request businesses in a category, prior to playing the choices, (iv) Purchase Category positions for an additional monthly fee, where they purchase the right to be announced/displayed at a priority level when the callers/browsers request businesses in a specific category. Category Level 2 position for the month will ensure that the promotional message is announced displayed in second place when the caller requests businesses in that category. If no Category position is purchased, information will be pulled from the database on a random basis, (v) Become a General Sponsor for a larger fee, where their message gets heard as the first message as the call gets answered, (vi) Purchase Audio Coupons, where the callers get a discount on products/services if they call Talk411 service and request information about the specific businesses, upon hearing a prompt from Talk411 or in any of the promotional messages mentioned above. Audio Coupons can be purchased with a credit card on the Talk41 1.com web site, or on the Talk411 telephone service, (vi) Get reports on access/call patterns in their category, (vii) Get consumer ratings of their businesses upon participation in the "BayHits" discount program. This program allows registered users to get discounts from participating businesses in return for feedback on products/services delivered.
Consumers can, for example: (i) Obtain free of service charges, unlike the 411 directory services which cost about $0.46 to about $0.95 (local phone charges still apply) get local business information, (ii) Have hassle free navigation with voice on the telephone, focused web site with local business information on the Talk41 l.com web site, (iii) Obtain more information than just the number and address for the local businesses. More relevant information, since the business chooses the type of info they want to communicate, (iv) Locate businesses of interest by speaking naturally and specifying a category, (v) Locate a business by specifying zip code, name, street, phone number or other relevant information, (vi) Get discounts with Audio Coupons from promoting businesses, (vii) Become a Reviewer by registering to the "BayHits" service and get discounts in exchange for rating businesses on the Talk41 1 service.
From a certain perspective, for a system to be adopted and used by businesses and consumers, the system must not only be technically viable but must also satisfy the needs of the users. In this vain, the inventive system and method are designed to satisfy certain overall philosophical and subjective requirements as well as other more specific telephone service, Internet response, merchandising and marketing needs. In the paragraphs immediately below, some of the concepts and mechanisms that will facilitate providing a high quality of service and acceptance and adoption by the business, organization, and consumer communities are described. These should be interpreted as goals and guidelines rather than requirements of the inventive system or method, as they need not all be adopted or implemented to sustain a successful implementation. Rather they contribute to implementing a commercial system and method, and as such provide options for a commercially successful system, method, and business model.
A caller or user (whether the business or other organization providing talk411 information of a customer making an inquiry) needs to be able to interact with the system with the utmost ease and simplicity. Speed and simplicity should desirably only be compromised for accuracy.
The effectiveness of the system and service may be assessed and measured with the ability of the caller/user getting information with no more than one phone call in most cases. A key benefit for the callers are convenience and savings from local merchants/businesses, hence improving the quality of their lives. Key benefits for the merchant/business/organization are the speed of deploying promotions (or other information) at a low cost, ability to get quick feedback from the marketplace and change their promotion or message dynamically, hence generating quality self- targeted leads. The first impression of the users should desirably be to fun, fast, accurate, and helpful information retrieval.
With these top-level needs in mind, attention is now directed to telephone service preferences. Prompts and menu designs, especially for audio prompts, menus, or other indicators. Sound may desirably be used as an indicator where possible to shorten the amount of words, and a minimum (or reduced) number of questions should be asked to get the answer. Where a response may not be clear, it may typically be beneficial to ask the user to confirm the input, in order to avoid providing incorrect or inappropriate information, and if still not clear, connect to live operator. As consumers are already familiar with U.S. nationwide basic 411 directory assistance, there may be desirability in emulating the basic 41 1 experience with additional courtesy and information. In some embodiments, limit the registration to phone number and password only, and direct potential member to web site to complete the registration, perhaps offering some incentives for such web based registration. Simple navigation using a limited number of intuitive key words to access information with a special lead phrase are desirably used. As a positive courtesy, a greeting at entry and hang-up may be provided. With respect to response time, it is desirable that the wait not exceed the typical time for operator assisted directory assistance, and a wait time of no longer than about 10 seconds (and preferably less than about 5 seconds) for information. When a caller makes an inquiry, the phone number is retrieved based on the business (or organization) name and location, and the caller is then asked whether to connect to that phone number. The location can be any designation, for example any or all of an address, a city, a zip code, or major cross streets or other recognizable location. If a location is not specified, then various rules or policies may be used to select, for example, in one embodiment, three possibilities are deliver by city or street names. In the event that there is more than one of the business in the area searched, for example there may be three Radio Shack locations in the search area, then the caller may be presented with three alternatives and locations selected according to any scheme, and asked for a decision with which to connect. In either case, the selection may be made either by the number or by the location.
Embodiments of the inventive system and method provide optional features including "more" information, and a "bayhit" feature. The "more" feature is provided for business or organizations that prefer to provide additional information than standard phone number, address, type of business information. N "bayhit" feature relates to a rating popularity or recommendation feature as described elsewhere herein. Businesses or organization, or specific locations of a business or organization, having more information may be identified using a distinctive sound, tone, word, or other audible indicator to identify them as having "more" information that may be accessed. Use of a distinctive sound reduces the time required to communicate the information and also reduces the computational burden of delivering additional spoken words. The bayhit feature (or equivalent by another name) may be indicated by a different sound, tone, or word than the "more" feature, and businesses or organization that are both more and bayhit may be identified with some third set of audio indicators or hybrid or composite sounds.
Where an audio or voice coupon is involved, the audio coupon may advantageously be played or enunciated at the time the phone number is delivered. Alternatively, the caller may be given an option as to delivery of the audio coupon.
When a caller request or inquiry is processed by the system, the system advantageously retrieves a predetermined number of the closest matches to the requested category and location. In North American culture, three seems to be the appropriate number, however, providing one to 4 of the closest matches might represent the normal range. In some contexts, providing five or more or the closest matches might be appropriate with due consideration given to the brevity of delivering so many alternatives. Ndvantageously, the names of the matches are delivered along with the availability of any coupons (when the coupon feature is implemented) and optionally the content of the coupon. Relevancy may be based on a user location input, this location input may be expected to optionally be provided automatically by caller identification based location determining mechanisms, cell phone base station triangulation mechanisms, or phone based Global Positioning System (GPS) location determining mechanisms. When such caller location information is available by whatever method, the search and relevancy may be a location sensitive search.
Upon delivering the predetermined number of matching (or most nearly matching businesses or organizations), the caller is requested to chose from these or to request delivery of additional matches. Either a predetermined number of additional matches may be delivered or the system may request the caller specify how many additional matches, with an optional limit imposed by the system to reduce the burden on system resources and the total connect time. In one embodiment, the number of matches delivered at each stage is three matches, and the total number of additional requests that can be made is four. More generally, the number of matches delivered for each request and the number of requests for additional matches for the same call may be limited to suit local markets and customs.
The system desirably recognize the caller's choice by business name or by either a vocalized queue number or phone keypad press corresponding to the number. IF one of the matches has more than one location in the relevant search area the caller may be presented with the alternative locations or address information when that business is selected. In this regard, the system may filter a business that has multiple locations so that the predetermined number of matches are to different businesses rather than to multiple locations of the same business entity. If the caller is interested in that business entity, then the caller is provided with information as to the multiple locations and asked to select or choose the desired location.
The phone number for the selected business or organization (and location) is then delivered and the caller is optionally but advantageously asked whether the caller wishes to be connected. The ability to be connected is particularly advantageous for a caller using a mobile phone where materials to write the phone number may not be readily available. Automatic connection is advantageously free to the caller and may be free to the business or may incur some additional fees. Various alternative credit, bonus, fee and non-fee based mechanisms may be implemented.
In some embodiments, it may be desirable for the caller to provide a phone number to a particular business name and location, and to have the system search and verify a business name by saying the phone number, and getting the business name back with the coupon if available. This feature may be referred to as a reverse search by phone number with optional coupon retrieval. This feature permits the caller to effectively make an inquiry as to the availability of a coupon offer at a particular business establishment, to make comparisons between business establishments, and to make a decision accordingly.
In a further embodiment, a coupon request may be made by the caller asking for a category alone, a category and location, or any other item or combination of items on which the database may be queried. Based on the results of such a category or other criteria search, the coupon may be provided. Note that although this and the above discussion focus on the voice based system, these coupon mechanisms (particularly the ability to perform directed searches on the basis of business identity and category) may also be implemented for the Internet based service.
Desirably, audio coupons delivered in this manner should have a maximum permitted length. For example, in one embodiment coupons are limited to 10 seconds and in another embodiment to 5 seconds or less. Caller consumers are not generally interested in receiving additional advertising, rather they want to know the essence of the offer, what discount, free item, cost saving, or other enticement is being offered, and no more. They are then free to make an inquiry once connected to the business, merchant, or other organization as to further information. Again, there is desirability of satisfying the needs of the caller consumer, the desires of the business, and the practicalities of system implementation where the number of lines, processors, servers, and other software and hardware infrastructure and impacted by call volume and duration.
Once the caller selects the matching business or organization with which to connect, the system dials and connects the call, checks for an answering machine tone, delivers a message that informs the business that this is a Talk41 1 call and identifies the coupon promised to the caller, and transfers connection with the business to the caller. Optionally, the caller may be connected to the business during this hand over or transfer process, but normally this is disadvantageous as it appears to add undue complexity to a process that the caller should view as transparent and effortless. Advantageously, in one embodiment of the inventive system and method, during some predetermined time interval following the connection to the selected business, the caller can take action to terminate the call with the business and reconnect with the talk411 system. This presupposes that a connection be retained with the talk411 system, or that a re-connection mechanism be provided. These mechanisms are readily available in the telephony art. This predetermined time period is anticipated to be in the 15 second to 2 minute time frame, and nominally about 30 seconds so that if the caller determines the selected business does not satisfy the need of the caller, he or she can try again without undue difficulty or expense. In one embodiment, the caller action involves pressing a key or combination of keys so that commands to the system may be clearly distinguished from a verbal discussion with the proprietor or employee of the business. A tone signaling scheme will likely be more readily inteφreted than a voice command, particularly if automated switching and routing is involved.
Embodiments of the system provide the ability for a caller to rate or otherwise provide positive or negative feedback regarding a talk411 member business or organization. Desirably, the rating of any particular member should be limited in some way to avoid the possibility of a single caller, unduly influencing a member rating in either an absolute or statistical sense. The limit may be rules that provide for a maximum total number of rating inputs for a business, a limited number per month or per year, or some other rating restriction rules. The goal is to provide other callers with a rating that fairly represents the opinion of all callers that have had an experience with the business, and not to allow either an individual that has had a bad experience to rate the business negatively multiple times. These limitations would also prevent an individual from inflating the rating of a member business by calling multiple times with a positive rating. Control mechanisms would also be desirable to keep the proprietor of the member business from self-rating in an unwarranted manner. These controls may involve technological mechanisms and/or legal contractual agreements with the member businesses. In one embodiment, a caller desiring to rate a member business has to sign on or register in some manner so as to identify themselves. For example, the caller may sign on to the system by saying a phone number and a password. Individuals desiring to participate in rating member businesses may be provided with certain benefits, points in a points or award program, discounts, or other promotional and/or marketing enticements for their participation. In one embodiment, the system implements a Rate2save points program wherein a registered caller earns points that may be redeemed for goods or services in exchange for rating businesses of organizations. In another embodiment, registration is not required, however, the rating made is associated with the caller's telephone number if the system can determine the callers caller ID so that multiple ratings of the same business from a single telephone number cannot be made, or cannot be made within predetermined time intervals. In the desire to maintain privacy, a message may also be played indicating that the caller's telephone number will be examined and compared for the purpose of ratings control and requesting permission to continue.
A caller will say the name of the business, location or other information such as phone number to quickly identify the business and rate on some predetermined scale, such as for example, a scale of 1 - 10, or any other numerical or non-numerical scale that permits the caller to use other adjectives such as for example poor, good, great, exceptional, too expensive, no parking, or any rating information that would be of use to a later customer. Ratings in multiple areas may also be permitted. For example, a restaurant may be rated according to food quality, food portion, service, price, wine selection, parking availability, or other factors of interest to potential customers. As system evolves and direct or indirect feedback is obtained from businesses and consumers, particular rating categories and scales found to be useful will be identified for the various categories of business and the ratings categories may be updated and modified accordingly.
Once the caller rates the business, points or other benefits that the caller rater may be entitled to will be reported and the system will hang up. These ratings will then be compiled and made available to subscribed member merchants and businesses on the web as well as the telephone.
For embodiments of the invention that provide a hit or popular site (restaurants, night clubs, coffee shops, bookstores, etc), best sites, highly recommended sites, or the like listing or directory such hit sights are referred to as Bayhits after the San Francisco Bay / Silicon Valley region where the invention was developed. Other regional areas may adopt other names for such attractions, such as Twin-City Hits (Minneapolis -St. Paul Area), LAHits (Los Angeles), or any other title whether containing "hits" or not.
Callers may make requests by category such that such BayHits (or equivalent) may also be found by category and delivered in the manner already described.
While many of the advantageous features of the inventive system and method are provided using voice recognition and text to speech processing over existing telephony infrastructure, other administrative facets may advantageously be taken care of using an Internet Web site or live agent. Live agents may also be occasionally used to set up and register. These administrative features may only be implemented where the clarity and preciseness of textual input and/or the volume of input increase the efficiency and accuracy of the process. These administrative procedures may also involve hybrid voice and Internet components. For example, an initial stage of registration or sign-up may be made over the telephone with minimal information provided over the telephone. The callers are then directed to a web site where registration may be completed. This form of registration may apply to either or both of businesses and consumers, for example consumers enrolled in the Rate2save points program. The advantage of such on-line or web based registration being that such registration does not tie up a phone line that might better be made available for responding to caller requests, and the accuracy associated with typed or menu driven registration may be somewhat greater given the current state of voice recognition technology. It is important that data entered into the database be accurate, and such accuracy is promoted by test input. This does not preclude the use of voiced or spoken input and it is likely that as voice recognition technology improves that the differences between text or character driven digital input and recognized and converted speech to digital inputs will lessen.
Merchants, businesses, and other organizations may register by providing their phone number and a password for a trial period. This may include a simple registration with minimal information obtained over the telephone and direct callers to the web site to complete the registration in order to take full advantage of all merchant and business services and options. In instances where further registration beyond the trial period are required, especially if the additional registration is required for the merchant or business to avail themselves of additional capabilities, such registration may be completed over the telephone, over the Internet, or through the use of paper document of electronic contracts or agreements.
After or as a component of registration, the basic information for the business, such as formal and common name of the business, business address, business phone number, are verified so that the business may subscribe to the inventive Talk411 service. If the business or organization is not in the database contact is made though a customer service representative or other means. Credit card or other bank or payment information is provided to initiate of continue subscription to the service. The business or organization is then provided with an opportunity to record a message to be heard by a caller when callers ask for more information about the business. In one embodiment, the duration of this business introductory message is limited to a predetermined period of time, usually between 10 seconds and 1 minute, and more typically not to exceed
20-30 seconds. An opportunity is given to review and re-record the message, to confirm the message, and ultimately to publish it. In one embodiment of the invention, provision is made to allow a celebrity or other talent to record the message. This optional service and feature is described in greater detail elsewhere in this specification. Various levels of membership are supported and different optional features may be made available at the various membership levels. Membership and subscription fees may in some instances be tied to membership level or to options selected.
Whether provided as a membership level or an option within one or more membership levels, a business or organization once subscribed (or registered in a trial period), may purchase sponsorship for the category they belong to by specifying the category, number of months category sponsorship is desired, and level of sponsorship. In one embodiment of this category sponsorship feature, the sponsor business is put in an ordered list or queue (or random pool) along with others who have paid for the same level of sponsorship. When its turn comes (whether through a sequential ordering or via a random or statistically determined selection), the name and phone number
(along with a attached coupon, if any) for that category sponsor business or organization will be heard if a caller asks for businesses by category. In one embodiment, there are three levels of sponsorship. Level 1 consists of those sponsors who paid the highest to have a greater frequency of presentation to callers (a higher hit rate), level 2 with lower frequency or lower hit rate and cost, and level 3 with the lowest frequency and hit rate and cost. In another embodiment, a category sponsor may pay for a fixed number of placements per week or per month.
Other placement parameters may pertain to geography, time of day, length of message, or other parameters that may tend to drive more business volume to the category sponsor. While category sponsorship is one type of sponsorship that may be implemented, sponsorship opportunities may exist for other aspects of the Talk411 service.
Voice or audio coupons may be created and purchased once a business or organization have subscribed or registered. Typically, a merchant will telephone and specify the number of coupons he or she wants to purchase and provide credit card information or other billing information and confirm the purchase. Usually, coupons will be bought in quantities in excess of 1000, though there are no limits on the quantity that may or must be purchased as any single transaction. The message that will be played in the form of a coupon is recorded (by the merchant or an agent of the merchant (such as a hired talent), along with the name and phone number of the business. This coupon message will not normally exceed a predefined duration, though different durations may be imposed for different types of business, membership entries, geographic regions, or other criteria. Typically the voice or audio coupon will not exceed 30 seconds in duration, more usually not exceed 15 seconds, and frequently will not exceed about five seconds. Some flexibility may be provided according to a minimum and/or maximum information that the merchant is entitled to record. In one embodiment, once a caller asks for the business, hears the audio coupon and gets connected to the merchant, the merchant is notified by the system that this is a Talk411 call and play the same coupon message to remind the merchant (or an employee of the merchant) about their promotion so as to avoid any ambiguity or problem associated with an uniformed employee contesting the caller's (now customer's) entitlement to the coupon offering.
In one of the embodiments, providing for talk41 1 ratings, a simple cumulative rating that is compiled from the raw ratings of the users is delivered when the merchant calls to check the rating. Security protections such as password or numerical code may be required to access such ratings, particularly when greater detail is provided to the merchant than to the general public. There may also desirably be a ranking within the category to which the business belongs. These ratings will likely be adjusted from time to time and evolve to suit the needs for each category. In one embodiment, there is a simple 1-10 rating, where 1 is the lowest rating and 10 is the highest rating. Other embodiments provide for callers to rate the business on multiple criteria as may be relevant to the category.
It will be apparent to those workers having ordinary skill in the art that the inventive voice or speech based system provides numerous features, capabilities, and advantages over existing systems. The inventive system and method also provide for both separate Internet or world wide web based interface as well as to an integrated system providing both voice/telephone accessed and character/Internet based access. All, or nearly all, of the information described for the voice/telephone accessed features and capabilities are also available for access through a computer graphical user interface
(GUI) over the web. It is not required to be able to interact with the web-based user interface with voice, or to connect the user to the merchant with the phone in this phase, however it is desirable to be able to play back the messages during an Internet web-based session that were recorded on the phone. It may even be desirable to allow a user to record a message using local available computer resources and upload the message to the system, however, this is not required.
In other embodiments of the invention, the user is provided with the capability to select a business and connect to the business using the Internet with Voice Over Internet Protocol based technology. Desirably, the graphical user interface, provides a differentiation between a recorded message versus a text message prior to someone clicking the prompt, so the user expectation is set properly.
An embodiment of the Internet web-based graphical user interface is now described. For reasons of simple access, the web pages are desirably sized and formatted to display on a single viewing screen without the need to scroll.
Advantageously, this is accomplished independently of the user's currently set screen resolution. Intuitive buttons and menus are advantageously provided where access and interaction is as simple as clicking on the available selections, and a single mouse click is used where possible. Quick page access is a high priority to encourage use without undue effort, therefore any graphics are desirably used judiciously only where necessary. Simple and clear icons, tabs, drop downs, and the like features are used. Some space may be reserved for ad banners and other content that may provide useful information for the user or a source of revenue for the site, site operator, or talk411 provider. Coupons will desirably be highlighted so the user can easily understand that they are coupons, and provide facility for printing either as part of the web page or via a separate but readily available free program.
Information storage, search, and retrieval are desirably fast and efficient to minimize any delays. Search and retrieval of business/merchant information such as name, phone number, location, address (any elements of the address) from any of the known data above. Search by category is supported, desirably with result priority given to category sponsors, and to those with coupons. Reserve a small section where BayHit businesses can be featured with a distinct graphic, linking to their business information.
The result of a search will yield a short list of found or matched (or closest match) businesses with the business information for a predetermined number of (typically 2-5 and usually 3), category sponsors visible in most of the page. The user may then click and select others in the list if not satisfied with the sponsors found. Each result page served with the next button should desirably serve the same format as above, except changing the sponsors and the short list.
For the business user, subscribed business users may be provided with one page where they can provide essential business information that will show as a result to a user search. They can select from an available selection of templates, upload their logo or graphics, and input their information. There will be a dedicated part for displaying their coupons if purchased. Custom pages not conforming to the template may be available as part of the standard subscription, available only to higher membership levels, or available at a higher fee. In one embodiment, customization is permitted but only within a certain range of options that preserve the talk411 look and feel. A dedicated management page may optionally be made available for the subscribers. Subscribed business will be able to track their coupon promotions, category sponsorships, how they are rated in the category, and their account history. Other information may also be provided. There will desirably be a provision to provide call volume, page hit rate, and comparative business results for businesses in the same category. This feature may be enabled when the volumes reach predetermined levels. Based on account activity and ratings, the talk41 1 service provider or an agent thereof will publish recommended action promoting our category sponsorship and coupon opportunities. Therefore it is desirable though not required that some space dedicated to Talk411 be reserved on every merchant page.
In one embodiment, a business, merchant, or organization are able to type in or otherwise enter or present the Coupon message or the Business information into the provided field which will then be translated into speech when a caller asks for the information over the telephone. Alternatively, Business/Merchant will be able to type in the information, and request the information to be recorded by a professional voice (or audio) talent from the list provided on the web site. This alternative feature is provided via an optional voice talent portal (VTP). Through this portal or other means, a user is able to select, such as by clicking on an icon or using a pull down menu, and hear a recorded clip of a message delivered by a registered voice talent as a preview.
Each registered voice talent will have a private page where pending jobs will be posted and messages to be recorded displayed. Once the voice talent completes the recording of the job, he/she will be able to upload the wave file (or other file format) for the recording to their private page, and verify and publish it for review by the business. A phone alert with the recorded message will be made to the merchant or organization to confirm and publish the message.
Upon completion with business approval, their credit card or other account will be charged or invoiced with a portion of it paid to the voice talent. Using the facilities available over the Internet, including the ability to use email and record, store and communicated high quality digital recordings, it is anticipated that such process should not take longer than 24-72 hours, and that for highly sought after talent, not more than one week. Additional fees and time may be required for premium talent. The ability to provide a quantity or volume of messages to a voice talent is a key value for merchants and businesses who want professional recording of their messages and don't want to pay the full hourly fee for such talent, or invent the overhead associated with contacting and negotiating a contract with the talent or their agent. Using the inventive system and method, talk411 aggregates the messages to be recorded and lets the merchants share the cost of voice talent recordings.
In one particular embodiment of the inventive Talk411 service, the design of the system is guided in part based on empirical results. These results suggest that the voice-recognition should desirably be highly accurate, preferably accurate better than 95% of the time. Automated announcements should desirably be a compelling and use consumer tested voice. Response time to voice inputs should ideally be no more than about 1 second in most cases, and typically no more than 3 seconds when searching.
Some fine tuning of the input and search timing will occur when the system is fielded.
Callers should desirably be able to interrupt any time and the system and method should desirably be able to deal with this. Transactions and identities should desirably be kept secure by the high standards. For consumer calls, when the system does not understand after two tries, it should desirably bounce to a live operator, or take other action in an attempt to understand the consumer. For example, if a live operator is not available, the system may use a more sophisticated (but possibly slower) processing algorithm to understand the speech. For business subscriber calls, when they say,
"help" there should desirably be some automated help, and when they say "save me" they should desirably be connected to a customer service representative.
Various hardware and software configurations, computer program code constructs, procedures, and mechanisms, as well as a variety of different operating scenarios and protocols for accomplishing these and other goals are now described.
Having now described embodiments of the overall system as well as several operating scenarios, attention is now directed toward a description of two particular embodiments, an Internet web based embodiment and a voice processing based embodiment. Attention is first directed to a description of the voice processing based system including the voice processing hardware and software. Voice Recognition Based System And Method
FIG. 1 shows the top or high level system architecture 102 and consumer user (or caller) 101 access points for a preferred embodiment of the invention. A cellular phone 106 (or other wireless device), standard telephone 118, either an analog (POTS) or digital, can be connected to the architecture 102 using a standard telecommunication link 120, such as a standard telephone line 122, ISDN line 124, cable 126 or DSL line 128. For a cellular phone 106, it is understood that there is a cellular base station 108 and a cellular switch 110 interposed between the cellular telephone 106 and the PSTN 112. The incoming call 130 from the cell phone 106 or the telephone 118 goes through a PSTN 112 and telephone switch 114 and gets picked up by the Speech server 116, which is connected on one side 117 to the telephone switch 115, and on the other side 119 to a computer network 130 such as for example the Internet.
Access may be via the Internet, for example access using wireless devices using or compatible with the Wireless Access Protocol (WAP). Data may therefore be communicated in a WAP compatible format. WAP is a standard set of protocols for wireless Internet access which can deliver special Web pages to portable devices such as smartphones and palm-portables. In current versions it relies on the use of WML instead of HTML and so on. These systems generally support Java and Windows CE, among others. WAP Version 1.0 has been released and is hereby incoφorated by reference along with variants and extensions thereof. Other versions are under development.
The user (or caller) 101 can access the information or data 132 that resides in a database 134 within the Information Center 136 and the Web Site 138 through interactive voice commands 140 and/or through keypad presses 142 on the caller's 101 device, such as on the cellular telephone 106 or standard wired telephone 118. In a preferred embodiment of the invention, only the caller's voice commands are used. The caller's voice commands 140 are recognized and translated into one of the variations of Voice Extensible Markup Language (VXML, VoiceXML, or VOXML) commands 144 by the Speech Server 116 using a speech-to-text conversion engine 146 and once translated into VXML are used to retrieve the information 132 from the Information
Center 136 database 134. VXML is an extension or elaboration on the XML (Extensible Markup Language) standard known to workers in the art and not described in greater detail here. Information concerning the VXML Forum is available on the world- wide- web at http://www.vxmlforum.org/ and Version 1.0 of the VoiceXML specification dated 07 March 2000 which is hereby incoφorated by reference is available in Adobe Acrobat format at http://www.vxmlforum.org/specs/VoiceXML- 1 OO.pdf .
Once the data 132 is retrieved and transmitted back to the Speech Server 1 16, the text information from the data 132 is converted to speech using a text-to-speech conversion engine 148 within the speech server 1 16 and played back to the caller 101 using the caller's device 106, 118. Speech server 116 also generates and plays back (presents) pre-recorded or synthesized menu commands 150 to the caller. The system architecture connects 102 the information database 134 to the Internet 130 (or other local or global network of computers and/or information appliances) which can also be accessed with a display device 152 such as a personal computer (PC) equipped with a modem 154 (wired or wireless), a smart phone 156, other wireless phones or devices, a PDA or palmtop device 158 or any computer or other information appliance or device that can be connected to the Internet (or other local or global network) with the ability to display standard Hypertext Markup Language (HTML) pages or other formats inteφretable by the computer 152. Access to such wireless devices may use the Wireless Access Protocol (WAP) and/or other protocols or standards. It is noted that although reference is made to several current industry standard data and information formats and protocols, such as HTML, XML, and VXML, the inventive structure and method are not limited to these particular formats and/or protocols or to the versions of these protocols in existence at the time the invention was made as those workers having ordinary skill in the art will appreciate the capabilities and features provided by these formats and protocols may be provided in other ways and that future versions of these formats and protocols will also support the inventive structure and method.
Embodiments of the inventive system may desirably incoφorate and utilize natural language speech recognition. In such implementations, the user can naturally speak and the system inteφrets the user' s speech to extract the request or inquiry. The provides additional flexibility for a user as that user does not need to know any particular commands or request rules or syntax. Natural speech processing and artificial intelligence are known in the art and not described in greater detail here.
The system and/or method described herein may be deployed as a separate installation or deployed through application service providers (ASPs) who have the hardware and software deployed either nationally, or in the local areas. Teleron and
Voiceeo are examples of such providers.
FIG. 2 shows an embodiment of Speech Server 116, some of its functions and functional connectivity to receive a switched telephone call and to interact with the Internet 130. Speech Server 116 performs several tasks such as the task of providing a Network Interface 160 to the analog or digital phone network that provides the switched phone call 162, Automatic Speech Recognition (ASR) 146 or speech-to-text conversion (STT), Text-to- Speech conversion (TTS) 148, runs the application or application program 164 that control and manages the phone calls 162 and the Interactive Voice Response (IVR) 166. IVR refers to the interactive voice response which is conventionally a menu driven response provided in response to an input. A user is asked to say something (for example, "Press or say 1 for marketing, press or say 2 for research", etc.) However, the inventors are not aware of any such conventional systems that provide ASR or text-to-speech in connection with IVR. In one embodiment of the invention, the Speech Server 116 is a personal computer equipped with Dialogic Antares automatic speech recognition boards and other products.
Information regarding the Dialogic Antares boards are available from Dialogic Coφoration, 1515 Route Ten, Parsippany, NJ 07054-4596 USA and on their web site at http://www.dialogic.com products/indx_abp.htm.
Operation of the exemplary Speech Server in the system is now described. The incoming call 162 is answered by a network interface card 160, such as for example a
Dialogic network interface card (analog or digital). A prompt is played to the caller 101 over the caller's device 106, 118 asking the caller to say the selected item 170 from the available selections on a voice or audio menu. When the caller responds to the request, the application 164 passes the voice data to the auto speech recognition block 146, such as may be provided by a Dialogic Antares™ board loaded with an Automatic Speech
Recognition (ASR) software. ASR software is available from several sources, including for example from Lernout & Hauspie (L&H) (LERNOUT & HAUSPIE Burlington, MA, Phone: 1-781-203-5000, Fax: 1-781-238-0986, http://www.lhs.com) or Speech Works (Speech Works International, Inc., 695 Atlantic Avenue, Boston, MA 02111, Tel: 617.428.4444, Fax: 617.428.1122, http://www.speechworks.com). A Dialogic Antares board-based automatic speech recognizer (speech-to-text)
146 translates the voice data into ASCII text (or another code or symbols) that identifies the spoken words and returns a text or other symbolic representation of the results to the application 164. The application 164 accesses, via for example a T-l line or faster Internet connection, the database 134 ofthe Information Center 136. Real-time (or near- real-time), active vocabularies are generated at run-time using the database's 134 ASCII text or symbols. The application uses the ASCII text from the database 134, passes it to a second Antares board 148 running a text-to-speech (TTS) algorithm. The TTS algorithm generates the final voice or audio information that is played to the caller 101.
Voice processing hardware is now described relative to one practical embodiment of the invention, using a Dialogic Antares D41E printed circuit card configured for a Microsoft Windows based computer that supports up to four analog telephone lines and provides a telephony interface and voice processing functions such as playing and recording speech messages and collecting or dialing DTMF digits. The embodiment also utilizes a Dialogic Antares 2000/50 platform is used in one embodiment, and allows the Nuance Voice recognition system (or other voice recognition system with appropriate modification) to operate properly on the Dialogic platform by providing echo cancellation and end pointing services for the system). The installed card will support up to 16 channels of voice recognition.
The Antares 2000 family platform provides a standalone open digital signal processor platform for telecommunication applications. The Antares hardware platform provides four independent TI TMS320C31 32-bit floating-point DSPs running at 50 MHz, each with high-speed SRAM, enabling algorithms to be easily ported to the Antares™ board. Multiple memory options provide flexible solutions for different applications. For example, 512 KB (128 K words) or 2 MB (512 K words) local SRAM are available per DSP and 4 MB (1 M word) or 8 MB (2 M word) global DRAM memory are available per board. SCbus™ (or PEB™) connectivity allows standard access to off-the-shelf call processing products and provides the capability to build higher density systems. Up to 32-channel capability is available. Drivers are presently available for MS-DOS®, UNIX® (SCO®, UnixWare™, Solaris®, and AIX®), OS/2®, and Windows NT®. The Antares platform open development environment uses the Antares board and an ANSI C Compiler and Assembler/Linker for the DSPs. SPOX support includes the run-time SPOX DSP operating system and SPOX-KNL kernel development kit. Additional software includes the Antares kernel , which integrates the DSPs into the SCbus system and a down loader to download software to the board. A single Antares board can process up to 32 channels of digital voice and digital telephony information received from a network interface board via the SCbus. The SCbus bit stream from the network interface board can contain both audio (digitized voice) and telephony signaling information. The bit stream is applied to a SC2000 ASIC chip which has an internal switching matrix. Under firmware control, the SC2000 chip can connect any external bus time slot (1024 for the SCbus or 32 for the PEB) to any
TDM (time division multiplexed) bus time slot on the Antares board. The Antares board uses four C31 DSPs, each with its own Static RAM (SRAM) to separate and process data for any assigned time slot. Each DSP processes data based on downloaded firmware stored in its SRAM. After processing the incoming data, each DSP communicates with the host PC via global Dynamic RAM (DRAM). Kernel services are available on every DSP. The Antares local kernel provides all the necessary tools to read and write data to and from the TDM bus, which operates as a PCM (pulse code modulated) highway. By using the DSP's internal DMA (direct memory address) capability, the kernel can bring multiple frames of PCM data (up to 32 time slots each) into DSP SRAM from the TDM bus. Each DSP connects to the local TDM bus via a serial interface. The TDM bus is implemented with 32 time slots that are mapped into 1024 external SCbus time slots by the SC2000 chip. Each SRAM contains the downloaded firmware that controls DSP processing and also provides the DSP with dedicated RAM for its data processing operations. A control port on the Antares platform accepts operational commands from the host PC and monitors DSP-to-host PC interrupt status. This control port controls Antares platform operations such as: DSP reset; communications with and firmware downloading from the host PC via shared global DRAM read write operations; the transmission of interrupts, control, and status information to and from the Antares platform via the global resource circuits. When the system is initialized, firmware to control each DSP's processing is downloaded from the host PC to the DSP's SRAM.
This downloadable firmware enables feature enhancement, application changes, and upgrades.
Additional information on this item as well as on other products that may be used in conjunction with the inventive system may be obtained from Dialogic Coφoration is an Intel company, with world headquarters at 1515 Route Ten,
Parsippany, NJ 07054-4596 USA.
The above described Dialogic hardware and the computer system in which it is installed, inter-operates with an operating system (such as for example, a Microsoft Windows operating system, a Linux operating system, a Unix operating system) and application program software including device drivers.
In one embodiment, the Talk411 system is an integration of commercially available software applications and custom applications. These commercial software and custom software components include operation under a Windows NT Workstation (v4.0 Service Patch 4) operating system environment which is available from Microsoft Coφoration of Redmond, Washington, USA. The software may alternatively be adopted to interoperate with any ofthe other commercially available operating systems such as earlier and later versions of Microsoft Windows, such as for example Windows 2000 Professional; or non-Windows operating systems such as Linux, Unix, and others as are known in the art. The core voice processing application is Parity software's VOS Version 6W
(Version 6 for Windows). Parity VOS (V6W dated 07-1999) is available from Parity Software, Three Harbor Drive, Suite 110, Sausalito, CA 94965. Parity Software is a Dialogic company which is owned by Intel Coφoration of Santa Clara, California. Parity Active Data Objects RLL (ADO RLL) is also used and is available from Parity Software, Three Harbor Drive, Suite 110, Sausalito, CA 94965. The Nuance voice recognition support the voice recognition requirements. Nuance Speech Recognition Software (v6.2.1) is available from Nuance, 1005 Hamilton Court, Menlo Park, CA 94025. A VOS-Nuance integration Run-Time Link Library (RLL) provides an interface between the Parity VOS software and the Nuance speech recognition system. An embodiment of a VOS-Nuance Integration Run-Time Link Library (RLL) is available from TeleVoice, Inc., 1 1767 Katy Freeway - Suite 425, Houston, Texas 77079. These software and system features are described in greater detail below.
The Parity VOS system includes a VOS Script file or files which have been compiled to binary code which can be executed by the VOS runtime-engine. In Version 6W, VOS licenses are required both to run the system (a run time license) and to compile the VOS script (a developer license), both of which are available from Parity
Software. (A developer license can be used as a run-time license but a run-time license cannot be used as a developer license.) Licenses are controlled by a hardware-key ("software sentinel" or "dongle") that plugs into the parallel port ofthe system.
Additional functionality to the basic Parity VOS script language may be achieved by external modules known as Run-Time Link Libraries or RLLs. Parity
Software provides some RLLs with Parity VOS and other RLLs which may be desired to implement additional features are available from third-party vendors. RLLs are used for example, to enhance the VOS capabilities to include such features as voice recognition, text-to-speech, access to ODBC-compliant databases and convert numeric, date and currency values to speech.
The application specific program modules used for the inventive Talk411 system and method include a Nuance voice recognition interface module and a SpeechPro module which converts numeric, date and currency values to speech. Other providers ofthe broad class of text-to-speech converters (of which numerals-to-speech conversion is a simple subset) are known in the art. For example, Parity VOS provides a similar RLL called SmoothTalker with the VOS application. Those workers having ordinary skill in the art will appreciate that there are a number of competitive commercial products that may be utilized and that the rapid development in this technology area will likely result in a proliferation of software and hardware products that may be used in conjunction with the inventive system and method. In Windows environments (such as in Microsoft Windows NT, Windows 2000, or the like environments) , RLLs take the form of DLLs or Dynamic Link Libraries. RLLs are normally specified in a file (such as for example in a RLLS. TXT file) which must be referenced both when compiling and when executing. These programming and compilation conventions and procedures are known in the art and not described in greater detail herein.
By convention, script files used by the application are designated by a " . vs" (yos script) or ".vh" extension, and compiled VOS scripts are designated by a ".vx" extension. Exemplary embodiments of selected ones ofthe files are summarized below. Additional detail is available in programming manuals and user documentation from the vendor, either by mail at the address provided elsewhere herein or via an Internet web site provided by each ofthe vendors.
Table I. Exemplary VOS Configuration Files
(Note that ".def files are the source files, ".par" files are the compiled version ofthe ".def file) cpb.def Stores Dialogic specific configuration information (Channel
Parameter Block)
CPB.PAR Compiled cpb.def file hedge=2 Detect greeting on trailing edge for outbound calls nbrdna=6 Rings before no answer signal given on outbound intflg=8 Specifies to use full Call Progress Analysis for outbound calls init.def Stores VOS initialization settings
INIT.PAR Compiled VOS initialization settings sc_pcpa=l Use Call Progress Analysis sc_words=20 Maximum words in a phrase (used by SpeechPro) sc_gtdterm=l Let hang-up tones terminate any voice functions fil_buf=16 File copy buffer size (in Kilobytes) pr.def Defines indexed speech files
PR.PAR Compiled pr.def file tone. def Stores PBX specific Tone configurations (generated by Dialogic
PBXpert prog.)
TONE.PAR Compiled tone. def Parity VOS configuration files are listed and briefly described in Table I. Parity VOS Source and Executable code files are listed and briefly described in Table II. Table III lists and describes various Other VOS related files. Additional description is provided relative to files and procedures that have been developed to support the inventive system and method and are not part ofthe commercial software programs.
It is noted that although Parity VOS is used in the particular exemplary embodiment, the inventive system and method are not so limited and that those workers having ordinary skill in the art will appreciate that other programming language and methodological constructs may be used to achieve the invention, such as for example implementations using VoiceXML, C++, or other scripting or programming language or languages.
Table II. Aspects of VOS Source and Executable code
(Note that ".vs" and ".vh" are source code, and ".vx" is executable code) master. vs This file "kicks off the Talk411 application for each voice line master. vx Compiled "start" file
Spro.vh Header file for SpeechPro commands and constants spro.vs VOS source code for SpeechPro functions talk41 l.vs This is the main application file talk411. vx Compiled main application file
VrCmd.vh Header file for VR commands and constants
Table III. Other VOS Related Files directory2.mdb Microsoft Database file used to store demo business info.
LocalRlls.txt File used to specify VOS RLLs for compilation
Rlls.txt File used to specify VOS RLLs for Run-Time The master.vs file is the file specified when launching the Parity VOS run-time engine. It contains all of the code necessary to open up the database, initialize and configure the voice lines and the voice recognition channels and spawn or launch the Talk41 1 application for each voice line. The presence and general structure of master.vs (and it's compiled partner master. vx) are known to workers using Parity VOS and other such commercial programs. Master.vs for the talk41 1 system is a file that does some set up, such as, determining how many phone lines are installed, and launches one talk41 l.vs application for each ofthe installed lines. It is somewhat like a computer system startup file and initializes some variables, establishes connection to the database that stores the business/merchant/organization information that will be retrieved in response to caller's requests, and then spawns one copy of talk41 l.vx for each line that it detected as being installed. Master.vs is specific to the talk411 application but it is generic in that each application would usually have a master. Several of the specific tasks performed by master are now described in somewhat greater detail.
Master, vx first determines and defines the number of voice lines installed. This represents the number of simultaneous callers (usually use function that detects number of hardware channels of lines installed, but where the number is stable, this may be hard coded). (Restrictions may also be applied where the number of lines is greater than the number of software licenses.)
Master. vx then opens a connection to the database in master that is shared by the applications. This database can be a local database either within the same physical box, co-located in a separate box or device, or located remotely. In one embodiment, the database is a Microsoft Access (Access 97) compatible database, however other database types and formats may be used. Advantageously, Parity VOS includes database functionality, so that a database created in Access may be used by VOS without requiring Microsoft Access or separate database application program. In one embodiment, the same data base file (dictionary32.mdb) is used to store and retrieve all ofthe business, merchant, individual, or other organizational material for both the voice recognition based talk411 and the Internet based talk41 1. Master.vx code instructions then initialize the Nuance voice recognition that sets up the mapping between the inbound call line and the Nuance resources. It also maps (or sets up an association or routing of) the Nuance processing channels to the voice caller channels. Master. vx then spawns or launches one copy of talk411.vx application for every line installed (subject to the number of licenses). There is an iterative loop that increments for the number of lines available. Operation then continues under the control of each talk41 l .vx compiled application.
The Talk41 1.vs script defines how the actual call handling and processing happens. The general flow ofthe call from the talk41 1 code perspective is as follows.
First, a code procedure module "Initialize()" sets up the telephone line or communication channel specific parameters and prepares the line or channel to accept a call. This line or channel may be wired or wireless and so the term line or channel is used in its broadest sense. Next, a "WaitForCall()" procedure waits for a ring signal on the line and when it arrives, takes the line off-hook and prepares it for processing.
An "Initialize Voice()" procedure module sets up the voice recognition parameters for this line and prepares for a recognition.
A "MainMenu()" procedure is where the call originates after the above initialization. At this point we set the Nuance Engine to the ".MainMenu" grammar and wait for a result. It returns a series of "slots" each with a valid keyword. The call is then processed based on the result of this recognition. If the caller said the name of a business, we give them the business information. For example, if the caller spoke a "Type" of business, the caller is provided with information on business of that type. Alternatively, the caller may either have spoken a command (such as help, quit, member information, or the like), in which case the command is handled, or the spoken utterance may not have matched any options in the grammar. Failure to match may be due, for example to the intended spoken utterance not being defined in the grammar, or the quality ofthe received utterance may have precluded matching. Various programmatic options are available requesting the caller to re say the utterance or to provide a different command or utterance. The talk41 l.vx file is the compiled version of the talk411.vs file. A exemplary embodiment of the talk41 l .vs is now described. The talk41 l .vs script file and its compiled version talk41 1.vx is application program that controls how a telephone call to the talk411 service is processed. An interface exists which permits signals sent from any telephone handset or other telephony capable device to be received and recognized by a computer or more precisely by a circuit or PC board designed to handle and process such signals. Telephony-to-computer and computer-to- telephony products made by Dialogic that provide this signaling and call processing capability are described elsewhere herein. In the recent past, interaction between telephone systems and computers primarily relied on touch tone frequency signaling, such as may be accomplished by pressing a series telephone keypad buttons to generate readily recognizable signals, or equivalently to have a computer program generate such tones. More recently, new technology supports spoken voice command and data in a system that provides voice recognition to inteφret the electronic signal generated from the spoken voice, inteφret that signal, and convert that inteφreted signal into one or more commands or data items that can be processed by the computer. Effectively, voice is used to give or respond to an inquiry. These hardware and voice recognition components are known in the art and therefore are not described here in greater detail. References are also provided to companies and organizations providing this type of technology as standard off-the-shelf commercial products. The talk411.vs, talk41 1.vx, and other associated or related files provide an additional layer or layers on top ofthe conventional telephony hardware and software, the voice recognition software and possible optional hardware or DSPs (voice-to- computer), and computer-to-voice (also referred to as text- to- voice), that direct the system in handling a call in a particular way and to provide particular features. The inventive software and system configuration used in conjunction with existing or customized voice recognition features constitute a novel computing and information processing machine and method.
As stated elsewhere in this specification, the Parity Software VOS Version 6 for
Microsoft Windows is used as a foundation platform. In its native uncustomized form, it supports only touch tone signaling, and does not support voice processing or recognition in its native form. Nuance has provided a voice recognition software
(Version 6.2.1) available from Nuance at www.nuance.com. but this software is not directly compatible with the Parity VOS software. Therefore, a third major software component integrates the Nuance voice recognition software with the Parity VOS software so that the three components accept voice or speech input, inteφret it, create a character or a string of characters, and provide that character of character string in the manner of one or more computer keystrokes or other digital data stream that is readily inteφretable by the computer.
More specifically, the Parity VOS software handles the interface to the phone system (including how to answer a call when a phone ring indicator signal appears, how to play messages to the caller after it answers, and how to accept touch tone signals form the caller. It also handles branching to further alternative processing based on what action is selected by user, and handles hang-up or call termination by hanging up the line, and resetting the line and the system and software to receive the next call. In essence is it responsible for call control. In many applications it is the only application software that is needed, particularly if the system is a touch tone based systems. The Parity VOS software version 6W does not have native voice recognition built in. Parity
VOS version 7 is available, and it is anticipated that Version 8 will be released shortly. Version 8 may provide some additional voice processing or recognition capability, or provide an interface or other integration with Nuance Voice Recognition (and possibly including others, such as Speech Works software). The interface software is presently a piece of software that permits use of Nuance Voice Recognition software and returns a result from that software to VOS for processing.
Parity Software also provides a developers toolkit in the form of a programing language and compiler for that language, so that software code, including data structures and executable instructions, can be developed to tell the VOS software and other components of the system what to do. These developer tool kits and documentation for their use are available from Parity Software and are not described in greater detail here. The talk411.vs script is developed using this VOS toolkit and compiled into talk411.vx. This script and its compiled version executes within the hardware to provide control the particular manner in which call flow is handled, what inquiries and responses are permitted (in conjunction with various dictionary and grammar filed described elsewhere herein), how to inteφret specif commands and situations. In essence, the talk411.vs and talk411.vx files control the life of a call. Note that multiple phone lines and interactions on a shared personal computer or server are permitted. In one embodiment, twenty- four simultaneous calls on twenty- four different lines are supported. With additional hardware boards and an appropriate configured computer, additional lines and calls are supported. Portions of an exemplary talk411.vs file are set forth in the specification. Selected components ofthe talk411.vs script file are listed and described and it is expected that those workers having ordinary skill in the art in light of the description provided herein, including the description of the function and operation, the commercial availability of Dialogic, Parity and Nuance Software and Hardware, will understand how particular aspects of call handling are implemented without detailed exposition of all portions ofthe script, dictionary, and/or grammar source or executable files.
The exemplary talk411.vs file includes the following components or functions: main menu {MainMenu()} , business information {BusinessInfo()} , connect to business {ConnectToBusiness() } , more business information {MoreBusinessInfo() } , recommend others {RecommendOthers()}, type menu {TypeMenu()}, modifier menu
{ModifierMenu()}, rate business {RateBusiness()}, member entry {MemberEntryO}, play promotional message {PlayPromo(ID) } , play coupon message {PlayCoupon(ID)} , record promotional message {RecordPromoMessage()}, purchase coupons {PurchaseCouponsO} , record coupon message {RecordCouponMessage()} , check score {CheckScore()}, category sponsor {CategorySponsor()}, get credit card information
{GetCreditCardO}, confirm information {Confirm()}, get voice response {GetVoiceResponse(Grammar, FirstPrompt, SecondPrompt)}, parse input string {ParseString(InputString)}, process slot {ProcessSlot()}, remove input string spaces { (RemoveSpaces(InputString) } , remove input string dashes {RemoveDash(InputString) } , remove input string parenthesis
{RemoveParens(InputString)}, save recording {SaveRecordingO}, set end time in seconds {SetEndSeconds(seconds)}, initialize {Initialize()}, wait for incoming call {WaitForCall()} , initialize voice {Initialize Voice()}, terminate voice {Terminate Voice()}, play message {Play(MessageName)}, play message b {Playb(MessageName)}, speech process the phone number {SproPhone(Number)}, and, play speech message that can be inteφreted by the caller {Playl(Prompt)}. These functions are largely self explanatory by their name. For example, the MainMenu() function lets the caller select a business, a type of business or enter the member area. The BusinessInfo() function reads the business information from the database and speaks the business Name, Phone Number, and any optional features such as the BayHit Message, Promotional Message, and Coupon Message. The
ModifierMenu() menu function is used if the caller only said a modifier such as Coupons or BayHits. The Playl(Prompt) function will play a speech message that can be interrupted by the caller speaking, where the result of their speech is thrown away, and where the function returns true if the message was not interrupted and false if the message was interrupted.
By way of further example, the MainMenu() function portion ofthe talk41 1.vs file is duplicated in Table IV. Exemplary portions of talk41 1. grammar code are also providd in Tables V and VI. As will be appreciated from the syntax ofthe MainMenu function in the table, this function lets the caller select a business, a type of business or, entry to the member area. Variables used by the function are reset, a greeting is optionally played, and a voice response is received from the caller. A determination is then made as to whether they spook the name of a recognized business, or of a business type, and whether any business type modifiers were spoken. The function also monitors to determine if any commands were spoken, such as for example "Help", "Quit", or "Member".
In particular, the GetVoiceResponse(".MainMenu", Prompt, "MainMenu") syntax results in playing a prompt entitled "MainMenu" that speaks: "Thank you for calling talk 411, please speak the name of a business or a type of business and we will get the information for you." Note that in one embodiment ofthe invention, the words or phrases are played back from earlier recordings, and that in a different embodiment they are synthesized based on the words or text. Either approach may be utilized and some implementations may utilize a combination of the two. In general, the speech quality is better using pre-recorded speech and is preferred. Table IV. Exemplary embodiment of Main Menu from talk41 l.vs script file. func MainMenu ( )
# This function lets the caller select a business, a type of business or
# enter the member area #=________=___=________=________________=_-_=____=_____=_=____===__----- do voslog ("Line: ", Line, " Main Menu");
# Reset the VR Variables before getting the next VR Business-" " ;
Modifier-" " ; Type="" ; Command-" " ; if (GreetingPlayed)
Prompt = "MainMenu" ; else
Prompt = "Greeting";
GreetingPlayed = true; endif
# Get a Voice Response from the caller GetVoiceResponse ( " .MainMenu" , Prompt, "MainMenu" ) ; voslog ( "Business-" , Business) ; voslog ( "Modifier-" , Modifier) ; voslog ( "Type=" , Type) ; voslog ( "Command=" , Command) ;
# Did they speak the name of a business? if (Business strneq "")
Businesslnfo ( ) ; else
# Did they speak a business type? if (Type strneq "")
TypeMenu ( ) ; else if (Modifier strneq "") ModifierMenu ( ) ; else
# Did they speak a command switc (Command) case "Help" :
Play ("Help") ; case "Quit":
Play ("Goodbye") ; TerminateVoice ( ) ; case "Member" :
MemberEntry ( ) ; default:
Play ("Help") ; endswitch endif endif endif until (Command streq "Quit") endfunc Table V. Exemplary .MainMenu Grammar from talk411.grammar Nuance Voice Recognition Component
.MainMenu [
Business Type
(Modifier Type) (Type Modifier) Modifier Commands
Assuming the caller spoke something the system detect what they said. If they spoke the name of a business then the string returned would not be empty. If the string is not empty, it is processed as if the caller spoke the name of a business. If the type variable has been set, the system knows the caller spoke a type of business and we process that. The program and system parse through and analyze the results that come back from the voice recognition process (Nuance) and process it. Note that the voslog operation merely displays to a display screen. It is further noted that this exemplary main menu is taken from a simple example of a talk411 script file used for demonstration puφoses.
In connection with the get voice response (GetVoiceResponse) operation, the ".mainmenu" parameter is a reference to a Nuance voice recognition grammar file that will be used in processing the voice response. When programming Nuance, Nuance requires the identification or definition of each symbol, word, number, or phrase that a caller may say in response to the prompt. Many or even most common words and numbers, as well as common phrases are defined in files provided by Nuance; however, current versions of the software do not allow unlimited words such as might be supported by desktop voice recognition applications. That additional level of flexibility and sophistication invariably require prior training by the speaker so that the speaker's voice characteristics are understood by the speech or voice processing software.
Within the ".MainMenu" portion ofthe talk411. grammar file are six top-level grammars: Business, Type, Modifier Type, Type Modifier, Modifier, and Commands. Anything with a capital letter as the first letter, such as capitalized "Type" means that the item or type is defined later on in the file or in another file. Further down in the same talk411. grammar file appears a block of code pertaining to further definition of the main menu "Type" top-level grammar. In what follows are a set of definitions, including definitions for such groups as attorneys, auto parts, and the like as indicated in Table V.
With respect to the "<Type "Attorneys'^ in the right-hand column, there are three ways in this example that a caller might refer to attorneys, including "attorneys", "lawyers", or "ambulance chasers". Clearly other deserved or undeserved names might be included. These are all the ways someone might say attorneys. In similar manner, the <Type "Auto Repair & Service "> provides several different ways in which a caller may request this type of business, including: "auto repair", "auto service", "auto repair and service", "auto service and repair", "auto repair service", "auto service repair", and even "auto repair and repair", "auto repair repair", "auto service and service", or "auto service service" though the later four combinations are unlikely to occur. Note that in the Grammar Specification Language (GSL) in which the definitions are written, the square brackets "[...]" mean "or", the parentheses "(...)" mean and, and the "?" mean that the item is optional, have meaning such as "and" or "or" which is part of the Nuance "grammar specification language" GSL. The "?" means that the term is optional. For example, one can merely say the word "hamburger" (as well as
"hamburger restaurant") and the system will respond as if the caller said "hamburger restaurant".
Note that the "Businesses" are all included in another file called
"talk41 1 businesses. grammer", but it is defined here (under the .MainMenu). The talk411 business. grammer is a file the contains all of the actual business names with variations on the way a caller may refer to a business. For example, a caller may speak either "Cisco" or "Cisco Systems" and mean the same business.
The other elements such as "Modifier" are identified with or mapped in similar manner using GSL. Within "Modifier" the parenthetic lowercase "coupon" is the actual word coupon that is spoken. These are the things that can be used to modify business or type. One notes that in the main menu, provision is made such that a caller can say "coupons for hamburger restaurants" and the system would extract both the modifier and the business type. Understanding the way people speak and providing the alternative expressions in the system is beneficial as it encourages individual consumers to use the system.
Table VI. Exemplary "Type" grammar from talk411.grammar file for Nuance
Voice Recognition Component
Type r L
[attorneys lawyers (ambulance chasers) ] {<Type
"Attorneys" >}
(auto parts) {<Type "Auto Parts" >}
(auto [repair service] ?and ? [repair service] ) {<Type "Auto
Repair & service" >}
(Chinese ?food ?Restaurants) {<Type "Chinese
Restaurant" >}
(hamburger ?Restaurants) {<Type "Hamburger
Restaurant ">}
(italian ?food ?Restaurants) {<Type "Italian
Restaurant" >}
(Japanese ?food Restaurants) {<Type "Japanese
Restaurant" >}
(mexican ?food ?Restaurants) {<Type "Mexican
Restaurant" >}
( [physicians surgeons] ?and ? ^physicians surgeons]) {<Type
"Physicians & Surgeons" >}
(pizza ?Restaurants) {<Type "Pizza">}
( [steak seafood] ?and ? [steak seafood] ?restaurants) {<Type
"Steak & Seafood Restaurant ">} ]
The grammar files are ultimately compiled to create a grammar library that identifies all ofthe various permitted combinations and permutations that are matched to. One may appreciate that for a large vocabulary, this data structure can become quite complex. A library of spoken words and phrases are used that are used to match and recognize the permitted words and expressions. In one embodiment, the libraries are built using the spoken expressions of human speakers.
The talk41 1. dictionary file is a file that contains an explanation of words or phrases that are not in the Nuance engine. This file is needed when one compiles, however the file may be and frequently will have no entries. It contains the phonetic
GSL description of the words or phrases that are not known by Nuance.
The Nuance Voice Recognition System was developed using source files called "grammars" which define the range of speech a caller can say in response to a given prompt. Like VOS scripts, once a grammar has been defined, it too must be compiled using the "compile-grammar" command. A Nuance license has heretofore been required on a per-port basis and the Nuance license has been controlled by a software key.
Grammars may conveniently be characterized as either "Top-Level" grammars and "Sub" grammars. Top-Level grammars are initiated from the VOS script. Sub grammars are essentially the building blocks used to creating a Top-Level grammar.
Each Top-Level grammar defines all of the allowable responses a caller can say in response to a voice prompt. A simple example of a top level grammar would be ".Confirm". The ".Confirm" grammar may be used in response to a voice prompt such as "Is this correct?" The ".Confirm" would be constructed of two sub-grammars, "yes" and "no". Each sub-grammar would include all ofthe various ways a caller could say
"yes" or "no".
It is noted that these grammar files having all allowable responses may be seen as being somewhat limited given developments in speech processing and recognition. It should however be understood that the software and system are intended to recognize a response from some arbitrary speaker who may have never accessed the system before and has not trained the system or software with their voice. The circumstances may therefore differ from those present for various commercial voice or speech recognition products, such as those made by Lernout & Hauspie Speech Products N.V.; Flanders Languages Valley, 50; 8900 Leper, Belgium or Lernout & Hauspie Massachusetts Office, 52 Third Avenue, Burlington, MA 01803. International Business Machines also provides continuous speech recognition products. These products permit a user to train the system to recognize virtually any words, and take into account not only the speaker's voice but also the microphone, memory, processor, and program application characteristics of the overall system. It may be expected that in time, the inventive system and method would be compatible with such as system; however, as the number of possible inquiries and responses are limited and may be known in advance, the use of such grammar files does not present any practical limitation to the invention.
Grammar files are designated by a .grammar extension. At least two additional files are required before a grammar can be compiled: a .slot-definitions file and at least one .dictionary file. The .slot_definitions file defines slots for natural language recognition. The .dictionary file (or files), for example the talk411. dictionary file, defines words that are not in the Nuance grammar and thereby allows the total vocabulary which may be spoken and recognized to be extended.
Embodiments of the inventive system and method provide the following five vocabulary related files. The "Talk411. dictionary" file defines words that are specific to the talk411 system and therefore not in the Nuance grammar. (It is noted that future versions of Nuance may be modified to include talk411 words. The "Talk411.grammar" defines the Talk41 1 application grammar. A "talk411.missing" file is a file that stores the result ofthe compile and indicates words not in the Nuance grammar. The "Talk41 1.slot definitions" file defines "slots" for the natural language recognitions.
The talk411. grammar file is the main file that links any and all other grammar files provided by Nuance. Any file that is a ".grammar" file is defined by an include statement in the main talk411.grammar file. Most of these .grammar files are provided by Nuance, such as grammars for obtaining and processing commonly used credit card expiration dates, through their commercially available tool kits. Note that in addition to the talk41 1 specific grammars, the main talk41 1.grammar file includes an initial set of include statements that link the other grammars. A portion ofthe code is provided in Table VII so that this grammar structure and relationship is clearly set forth.
These "included" files handle, for example, common expressions, yes/no grammar, number grammar, thousands grammar, credit card number grammar, credit card expiration date grammar, and a larger talk41 1 business grammar that is a grammar containing the talk411 business, merchant, and organization names. In this particular embodiment, all ofthe grammars except for the talk411 business grammar are provided by Nuance, and are common grammars that are used for many different voice recognition applications.
Table VII. Exemplary talk411.grammar showing include statements to Nuance grammar files
Talk ll Nuance Grammar Definition File
Talk ll .Grammar
#include "common. grammar" #include "number .grammar" #include "yesno. rammar" #include "thousands .grammar" #include " CreditCardNumber . grammar" #include "CreditCardExpiration. grammar" #include "Talk llBusinesses .grammar" .MainMenu
[
Business Type
(Modifier Type) (Type Modifier) Modifier Commands
. TypeMenu
Business Commands (more ?businesses) { <Command "More " > }
]
(additional items not shown here follow) The "Talk41 IBusiness. grammar" lists all ofthe business in the system. In one embodiment, this Talk41 IBusiness. grammer is a dynamic grammar that can be added to or modified on the fly, that is as the system is up and running and being accessed by callers. In the other embodiments, such as developmental versions, this grammar is a static grammar meaning each business is added manually. In this manual version, the grammar is recompiled and the system restarted after making any changes to this file. An exemplary sample entry in the grammar is listed below:
(straw hat pizza) {<Business "Straw Hat Pizza">}
The entry on the left (straw hat pizza) is the actual phrase of set of words the caller must say or utter to select this business and the entry on the right is slot (Business) and the keyword (Straw Hat Pizza) that would be returned to the application for processing. Note that in this particular embodiment, the entry for the keyword must match the corresponding entry in the database exactly for a match to occur. Also recall that entries in the database and in the grammar may be established for variations ofthe phrase. For example, there may be an additional entry for "Straw Hat" that does not include the word "Pizza". Therefore the requirement for exact matching is not unreasonable given that all of the possible variations that may match are made available.
The talk41 1 business are just a list of business, such as straw hat pizza, Michael Italian foods, or other business names. These are just examples, and the names are put in a separate file for convenient administration. Ultimately, a business will be able to sign up without any human intervention using technology currently being developed. Under these systems, it will merely be a matter of the user typing in the name, and allowing the system to process it. If it can be recognized without help that will suffice, however in some instances it is anticipated that a user may have to say the new name one or more times. It is also possible to specify the phonetic characteristics using GSL or other phonetic descriptive language, however this skill would not generally be available to lay people using the system. Nuance also provides for "dynamic grammars" that permits adding entries to a running application on the fly. In this way, business subscription or membership may be implemented automatically without requiring system restart or the like. Nuance and Talk41 1 grammar files are listed with a description as to their function in Table VIII. Table VIII. Nuance and Talk 411 Grammar Files common.grammar Nuance provided grammar of common expressions
CreditCardExpiration.grammai Nuance provided grammar for credit card expiration
CreditCardNurnber.grammar Nuance provided grammar for credit card numbers
MAKEGRAM.BAT Batch file for compiling a grammar number, grammar Nuance provided grammar for numbers
Talk41 1.dictionary Talk41 1 dictionary file defines words not in the Nuance grammar
Talk411.grammar Talk411 application grammar talk411. missing Result of compile that shows words not in the Nuance grammar
Talk41 l.slot_definitions Defines "slots" for natural language recognitions
Talk41 lBusinesses.grammar Talk411 business thousands.grammar Nuance provided grammar for numbers in the thousands yesno.grammar Nuance provided grammar for yes/no responses
The inventive system and method also utilize a database file that stores Customer or Business information. In one embodiment, the database file (directory2.mdb) is created as a Microsoft Access 97 database file. Microsoft access was selected for this embodiment, as the other program modules making use of the database are able to store and retrieve information from the file without having Access program running. Other database structures or programs may alternatively be utilized.
In one embodiment, the database named "directory2.mdb" includes a table named "Customers" and Field Names: MemberlD, CompanyName, PhoneNumber, Address, City, Region, Category, Specialty, BayHitScore, BayHitRates, Coupons, and CouponMessage. These fields include entries which are clearly optional in character and pertain to certain specific features provided by the inventive system and method. For example, the BayHitScore, BayHitRates, Coupons, and CouponMessage fields provide novel informational features not present in existing systems or methods. In fact, nearly any database fields may be provided, it is merely a matter of identifying the type of information or content that is to be provided in response to a caller. It is anticipated that these informational or content items will include some identification for the business or entity (company name), telephone contact information (phone number), and address information (city and street address). It may also be desirable to provide some regional information separate from the city and street address (regions), and such region information may define a local neighborhood, or a multi-city or metropolitan area. Driving instructions from key recognized points of interest or locations may also be provided. Specialty may permit a member business to identify a particular product for which they are noted. For example, an Italian Restaurant may wish to identify themselves as specializing in Italian Vegetarian Cuisine, a night club as specializing in swing dance music, or a book store as specializing in first editions. There are a myriad of other fields that could be provided in the database and for response. It is noted that in one embodiment, the same database is used for the voice accessed and for the Internet accessed system. In the voice accessed system and method, the information may necessarily be limited to a set of words, short phrases, or expressions to simplify the voice recognition and text-to speech processes. It is also generally true that individuals accessing the inventive system from a telephone would prefer short to the point answers or information rather than lengthy explanations or pitches, particularly if such access is made while driving from a mobile telephone. Therefore, a single database may include entries that are common to voice and Internet accessed systems and those that are not common. For example, an additional set of fields may be provided that are served only when accessing over the Internet. Alternatively, two different databases may be provided which are optionally synchronized. One embodiment, maintains separate databases that are regional in character with the concept that a caller in New York City would not need to access the information for businesses or organizations in San Francisco.
Such a database is defined in a memory storage device (either in solid state memory or bulk storage device) on a computer system. The computer may for example be a server computer, many types of which are known in the art. The computer and any attached storage may be on the same machine or on a different machine. Structures and methods for building, maintaining, and accessing databases are known in the art and not described in greater detail here. Several fields for an exemplary customer data are listed in Table IX. These fields include the Company name and phone number, as well as optional but advantageously provided data fields for address, city, region, category, specialty, bay hit score, bay hits rating, coupons, coupon message. A member ID is also provided for accounting and identification puφoses.
Table IX. Exemplary Customer Database Fields
Field Names: MemberlD
CompanyName
PhoneNumber
Address
City
Region
Category
Specialty
BayHitScore
BayHitRates
Coupons
CouponMessage
Having described the software modules and procedures present in an embodiment of the inventive system, attention is now directed to operation of the system. In order for the system to be started, each software module must be launched in the proper order. In one embodiment, the order for launching and a description of each module is listed below. It is noted that this ordering applies the particular scripts, compiled scripts, programs, or procedures as they presently exist and that variations in this order may be anticipated for a different set of scripts, compiled scripts, programs, or procedures.
First, the "license.exe" is launched. This is the Nuance license engine that must be launched with a valid "license.txt" file which controls the available recognition sessions that are allowed on a given system. (This program file is executed as a condition of using the Nuance software and is not otherwise required for system operation.)
Next, the "resource-manager.exe" program file is launched. This is a Nuance program which is like the gatekeeper for controlling the Nuance sessions. The
"recognition-server.exe" is then launched and provides the Nuance grammar engine that is started with the grammar "package" defined for the application.
A VOS-Nuance interface "VRServer.exe" program is then launched to provide an interface between VOS and Nuance. Finally, the "VOS6W.exe" program file is launched. This VOS6W.exe file provides the Parity VOS application engine, and must have an identified valid .vx (compiled VOS script) file specified in order to start.
Conveniently, an executable file (StartVR.exe) is provided to facilitate this ordered launch and execution process. This executable performs the above tasks and monitors the program modules as they are running. One embodiment of a particular example ofthe general flow of new business user (new merchant) 201 interaction with the inventive system and method according to one embodiment of the invention is now described with general reference to FIGs. 1-4. A business user 201 is a user that is providing goods or services to consumers 101 where consumers also refer to the previous caller 101. The business user is desirous of having their goods and services made available to consumers over the inventive system and in promoting their goods and services to consumers.
Once the business user 201 calls, and gets identified as a new business user utilizing a business user registration procedure the business user is asked to say certain business registration information, including for example their name, name of the business, phone number, credit card number, and/or other pertinent business information. Once the registration information is obtained, the system 102 compares the information provided by the new business user 201 with the information that resides in the database 134. The database includes information regarding business so that the authenticity of the attempted registration can be verified with reasonable assurances. In the even that the information does not match, the system 102 may connect to other databases in an attempt to verify the authenticity or otherwise complete the registration. If the information matches, the user registration is completed. If the information does not match, the user is notified with a message providing the new business user with additional options or information, or to recommend trying to say the information again. For example, in one embodiment ofthe invention, the new business user is prompted with the audio message "Sorry, but the information you have provided does not seem to be correct, say 'again' to start over. You can also hang up and call again, say 'help', or register at our web site www.Talk41 l.com".
Once the new business user 201 is registered and a password is issued to the business user, then he or she is requested to record a short message that will be heard by the callers 101 who request the business user ' s phone number. Voice recognition can be used (in conjunction with an optionally previously stored authentic voice print of the business user) to authenticate the business user 201 in addition to or instead ofthe password 212 depending on the quality ofthe speech recognition technologies used and the quality ofthe line or other communication ling connecting the business user to the system at the time. So called "caller identification" available in some areas may also assist in verifying the identity ofthe business user where the business user would then be required to call from a registered telephone number.
Once the business user approves the short message just recorded, the recorded short message is published then he or she is requested to provide an additional longer message that may be or include a special promotion, directions to the business location, or any other information that will provide additional information to the callers. Both of these short message and long message are available for playback to callers and can also be viewed in text form by those who visit the web site and look up that particular business. The new business user can change either message completely over the phone, or edit it word by word on the PC connected to the web site. Opportunities to subscribe as a category sponsor, and to record and publish a category sponsor message, provide a service description, and to review or change messages are provided.
It is noted that the messages provided by the business may either be a representation ofthe business representatives own speech which is preferred so that the quality and character of the voice is maintained, or the message may be computer synthesized speech. The later being necessary if the business chooses to provide or later modify the message using text input on a computer. As maintaining original speech may be somewhat cumbersome, additional fees may be levied on the business for providing actual speech as compared to synthetic speech. Alternatively, the business user may be able to select from a set of available synthesized voice types so that the voice, even though not provided by the business directly, provides the intended feeling or emotion associated with the business. For example, a restaurant may wish to convey the feeling of romance.
In yet another alternative, the business user may choose a voice talent to record the message, or choose to have synthesized speech simulate a celebrity voice either living or deceased.
Having described the general operation of the system during a new business interaction, we now describe one particular exemplary embodiment ofthe new business interaction procedure. As illustrated in FIG. 3, the business user calls the decision is made as to whether the calling business user is a new user or in existing user. Once it is determined that a new account needs to be established, registration of the new business account proceeds as described above wherein the calling business user provides certain business registration information register the new account (Step 302). The registration information provided by the registering new business user must be verified (Step 304) before the new business user interaction can continue. If verification cannot be made, then the interaction is terminated (Step 306), otherwise the business user is prompted to record a short message (Step 308). The business user can then approve the recorded short message 214 or change the recorded short message until the business user is satisfied with the recorded short message an approves it for publication (Step 310) at which time the short message is published in a voice form and in text form on web site 134 (Step 312).
Business user 201 is then prompted to optional record either no message, and long message 218, or sponsor message 219 (Step 314). In the event that the business user to record no additional messages, the business user is thanked for providing the information (Step 334) and the interaction terminates (Step 336). If the business user chooses to record a long message than the business user records the long message (Step
316) and a given on opportunity to approve the recorded message or change that message until the business user is satisfied with the recorded long message (Step 318). The long message is published (Step 320) and the business user is again the given opportunity to learn about category sponsorship (Step 322). If the business user declines the opportunity to learn about category sponsorship, the business user is thanked for providing the information (Step 334) and the interaction terminates (Step
336). On the other hand, if the business user indicates a desire to learn about category sponsorship he or she is provided with the description ofthe sponsorship service (Step 324) and again asked if he or she wishes to subscribe to the category sponsorship service (Step 326). A category sponsorship message is a message will come up when the caller requests businesses in a category without a specific business name in mind.
Then the system will play back the message of the sponsors in that category in a pre-determined order, random order or a dynamically defined order based on some policy or membership level or category, or other rules. If the business user declines the opportunity to subscribe, the business user is thanked for providing the information (Step 334) and the interaction terminates (Step 336). If the business user indicates a desire to subscribe, he or she is given opportunity to record a category sponsorship message (Step 328) and further opportunities to either approve or change the message until he or she is satisfied with the recorded category sponsor message (Step 330). The category sponsor message is then published (Step 332) and the business user is thanked for providing the information (Step 334) and the interaction terminates (Step 336).
As illustrated in the flow chart diagram of FIG. 4, the procedures associated with the repeat or registered business user interaction are substantially the same as, though not identical to, those just described for a new business user interaction. The differences primarily concerned how the initial phase of the business user call to the system is handled. For in existing registered business user interaction, the system receives the business user call and determines if it is a new user or an existing registered user (Step 352). If the system determines that it is a new business user, then the procedure already described relative to FIG. 3 is executed. However, if the system determines that an existing registered business user is calling into the system, it presumed step the existing business user wishes to make changes to one or more ofthe items of registration information or to one or more ofthe recorded messages (Step 354). If the business user decides after placing the call that he or she does not wish to make changes than the interaction terminates (Step 356), otherwise the business user is asked whether he or she wishes to change the short message, the long message, or the sponsor message (Step 358) and given opportunity to change one or more of these messages. These messages are the Long, Short and Sponsor messages that he may have already input into the system via a phone or a personal computer of other information appliance.
The process for recording, changing, approving, and publishing each of these messages is similar to that already described relative to FIG. 3 and indicated in the FIG. 4 flow chart, and the business user it is similar given additional opportunities to learn about, subscribe to, and record messages pertinent to additional services provided by the system.
In order to make the user interface more satisfactory, additional steps can be introduced or some of the shown steps can be deleted from the interaction flow. For example, after the Business User makes changes to the short message, he can be prompted to see whether he wants to make any changes to the long or the sponsor messages. An example of the deletion of a step can be where the user is initially prompted to find out whether he wants to make changes and gets told that he can say anytime "make changes" and trigger the menu options. The amount of consolidation largely depends on the speech recognition technologies employed and the key words chosen for the speech recognition vocabulary. In addition, other embodiments of the invention may largely or entirely eliminate the particular command and data extraction procedure set forth in the above described procedures and replace them in all or in part by a natural language recognition and extraction procedure that either listens to the user's request in free form speech and extracts commands and/or data from the user's speech, or extracts the commands and/or data in part and intelligently asks additional questions of the user for any added information. In this sense, the inventive system and method provide logic for conducting a dialog or conversation with the caller. Essentially the same or substantially the same information is exchanged between the user and the system but with a more flexible interface that is more familiar and enjoyable to the user. Those workers having ordinary skill in the art in light of the description provided here will appreciate that the procedures described for existing registered users as well as for new business users may be modified to provide somewhat different options at each stage ofthe interaction or to provide different ordering ofthe options. Therefore, the interaction described here are merely exemplary ofthe type of business user to system interaction desirable in an implemented system, but does not limit the inventive system or method to these particular interaction schemes or procedures.
FIG. 5 shows an exemplary embodiment of the General User interaction 402 such as might occur when a consumer calls for information. Once the consumer user 101 calls, a greeting message 382 is played back (Step 404) such as "This is TALK411 , your best source for local information" followed by a sponsor message 384, such as for example "brought to you by Dialsurf, bringing the web to your phone" (Step 406). This sponsor message is typically a paid message by a sponsor. Then the voice menu 386 is played back to the caller (Step 408), such as "Please say your selection: Restaurants, Lawyers, Auto dealers, etc.". Once the consumer user says one ofthe menu items (Step 410), then he or she is prompted with a request message (Step 412), such as "Say the name ofthe business or say 'select'".
If the consumer user 101 says "select" or another word identified to the system that indicates to the system that he or she (the consumer user) should be prompted with a list of pre-selected business names, he or she is prompted with a request to specify selection criteria (Step 414). This criteria 388 is pre-determined and varies according to the type of business. In the case of restaurants for example, it may be "type of cuisine, city and zip code". In case of lawyers, it may be for example "type of practice, city and zip code". Once the user says the criteria 388, then the system 102 tries to match the requested category or criteria 388 to the closest category or criteria stored in the database 134. If the match is good (according to some predefined rules or decision algorithm or procedure), then the system will play back a number of business names pre-determined by the system (Step 418). These names can be picked from the database 134, in the requested category, in a pre-determined fashion, randomly or based on a dynamically changing criteria or some fixed set of rules.
The inventive system, method, and business model or operating method is applicable to a broad variety of business and merchant types including but not limited in any way to: restaurants, physicians and surgeons, auto parts, auto repair and service, pizza, auto dealers, department stores, attorneys/lawyers, dentists, hospitals, insurance, beauty salons, banks, plumbing contractors, florists, as well as many other types of businesses and services. The system may also be adopted to professionals, or other individuals or organizations other than those specifically qualifying as businesses or merchants.
One example of pre-determined way is for the subscribed businesses to pay the corresponding fees to be included in the top category (Category #1), second category (Category #2), and the like for a specific time period. An example of random procedure can be, as the name implies, based on a random number generator that picks a database record in the category requested. An example of dynamically changing criteria is when users rate the businesses on a real time or periodic basis and which ever business is rated highest gets to be heard as the #1 (first named), #2 (next named), #3 (third named), and the like down a hierarchical or other list.
After the pre-selections are played back, the caller is invited to say the 'number of the menu selection' or to say 'more" (Step 454), if the caller responds with the number ofthe selection, the number and a short message is played back (Step 450) and he or she gets prompted with a questions such as "Say connect or 'more' for additional information" (Step 452). If the caller says "connect", the caller is connected to the phone number that was found (Step 444). If the caller says "more" (Step 436) then the pre-recorded Long Message is played back (Step 438) with a question such as "Say connect or just hang up your phone" (Step 440). Based on the caller's selection, either the caller gets connected to the phone number (Step 444), or gets disconnected to the service (Step 446). Of course, different rules may be applied to permit the user to input different choices, however, in some situations it is desirable to have a user call in again when they have rethought their need rather than to tie up the connection for an extended period of time.
If the caller responds by saying 'more', then additional pre-selections are played back to give the caller more and different choices (Step 456). The caller may then either say the number of one ofthe new selections (Step 458) or terminate (Step 446). In some instances, the caller may be permitted to keep repeating the request for more choices until all choices available in the data base (or a predetermined number of such choices) have been presented to the caller. In either event, if the caller does not like or select one of the available choices, the call terminates (Step 446).
If there is no match (Step 420), an answer such as "sorry, but we cannot find this category in our list, try again" is played back (Step 422). After a predetermined number of tries (for example, after two tries) if there is no match, the system will say something like "sorry, we could not find a business that matches your request, please call us again" (Step 424) and terminate the call (Step 426). If there is a close match, the system will play back the match to verify the request for further action (Step 428).
Once the caller chooses the business by saying its name or menu number (Step 430), the number and a short message is played back (Step 450) and he or she gets prompted with a questions such as "Say connect or 'more' for additional information" (Step 452). If the caller says "connect", the caller is connected to the phone number that was found (Step 444). If the caller says "more" (Step 436) then the pre-recorded Long Message is played back (Step 438) with a question such as "Say connect or just hang up your phone" (Step 440). Based on the caller's selection, either the caller gets connected to the phone number (Step 444), or gets disconnected to the service (Step 446).
If the caller says the name of a particular business instead (Step 448), then the phone number and the Short Message (refer to FIG. 3) will be played back (Step 450) with an additional prompt (Step 452), such as "Say 'connect' or 'more' for additional business information (the Long Message per FIG. 3). Once the Long Message is played back (Step 438), the user will be prompted once more whether the connect or terminate the call (Step 440).
FIG. 6 shows an exemplary implementation of the Directory Service of the information center 136, where the Web Server 471 serves VXML or HTML/XML pages 470, the LDAP Server (Lightweight Directory Access Protocol Server) 472 runs over TCP/IP and provides quick response to high volume lookup to the Database 473. It is noted that this is merely one example of how an information center may be organized and that its computer or server based organization may be integrated with various types of call centers, service centers, and the like as are known in the telephony and server technologies. The Middleware 474 is the layer of software that integrates operations of the Web Server, LDAP Server, and the Database (or any additional software such as Transaction Server). LDAP Servers and operation are known in the art, and is described, for example as of 23 March 2000, at: http://www.umich.edU/~dirsvcv/ldap/doc/guides/slapd/l.html. An example of a Web Server for high volume application such as TALK411 is the Microsoft Internet Information Server (IIS). Microsoft IIS runs on Windows NT Server. LDAP also runs on Windows NT® 4.0 using Service Pack 4 or later, Windows® 2000, or Windows 95/98. All systems desirably have TCP/IP (or an equivalent capability) installed. Additional information relative to Microsoft products, including Microsoft IIS is available on their website as of 23 March 2000 at http://msdn.microsoft.corn/isapi/msdnlib.idc?theURL=/library/psdk/ldap/ld about 7euh.htm
Other optional but desirable features may also be provided. For example, one desirable promotional feature involves issuing an audio coupon (also referred to as voice coupon) to a consumer user of the inventive system. In one embodiment, a consumer user is issued an audio coupon entitling the user to a promotion. Typically, such promotion would entitle the user to a discount to be applied to the item or service purchased when the consumer user connects using the inventive system and method. This discount, for example 10 percent off, would only be available to the consumer user when using the inventive TALK411 system and is therefore an enticement for a consumer user to use the inventive system rather than dealing with the business through conventional means. Other promotions might involve a buy one get one free offer, of free drink with order of food type offer, or any other the other variety of promotional offers typically made in the retail trade between merchants and consumers.
The audio coupon may be provided in a variety of ways. For example, the business would become aware that the consumer user contacted the business using the inventive TALK411 system and automatically give the consumer user a discount (or other promotional item) when the your was placed. Alternatively, the consumer user might be given a coupon code which could only be available to a consumer user who ate utilize the inventive system, and a consumer user would provide this code to the merchant upon connection. This code might be generic to the business or particularized to that specific transaction. Therefore, in addition to be self promotion aspects ofthe business model, the optional use of audio coupons also provides considerable business advantages. In one embodiment of the invention, the system inserts a message to the merchant after the call has been connected to identify the caller as a valid service user and to validate the audio coupon. In a preferred embodiment, the use of audio coupons is integrated with the world wide web or Internet in that the audio coupons may be identified, stored to, retrieved from, or otherwise processed using the businesses or the inventive services web site. In this way, the consumer user is not limited to using the coupon at the time it was earned, but may instead be collected for later redemption. This also affords an opportunity to obtain a printed copy of the coupon for use at any later time.
In yet another embodiment, the audio coupon or a coupon derived from that coupon may be delivered to a personal data assistant or PDA (such as for example, an email enabled PALM VII) so that the PDA stores the coupon and serves as a medium for displaying the coupon the business, merchant, or organization. Independent of how the coupon is delivered, one aspect ofthe inventive system, method, and business model is to collect money or other revenue in what ever form for each coupon delivered. It is also advantageous to collect money or other revenue for each coupon redeemed either as a fixed amount per coupon or as a portion (such as a percentage) of the sale, or both. Collection of revenue for each coupon delivered is separate from collection of revenue for each redemption or sale.
A Professional Message Conversion Service and Method Embodiment is optionally provided and provides a capability for converting recorded or typed (character based) promotional messages to professionally recorded message. Once the user records the promotional message and saves it, the user is prompted to keep the message the way it is recorded, or to get the message re-recorded by a professional audio talent. Then the recorded message is sent to the mailbox ofthe audio talent. He or she records the message and saves it. An email is automatically sent to the user notifying that the message is recorded and will be activated once he/she reviews it.
In a further embodiment, the system has geographical context provided by a known location of the caller. For example, it is expected that mobile or cellular telephones will have capability to self locate, either using internal satellite-based Geographical Positioning System (GPS) means or by using various schemes known in the art for determining (or estimating) the location of a cellular telephone based on proximity to cellular base stations, hand-off s to base stations, and similar techniques. In any event, the inventive system provides for geographically-based recommendations, geographically-based promotions, as well as for geographically based audio coupon delivery. Here the geographic proximity may be established according to some set of rules which may for example depend upon the density of business establishments in the local area. However, in one embodiment the geographically directed audio coupons pertain to business within one to a few blocks of the callers location, in other embodiments to a mile or two, and in still other embodiments to the region ofthe city or town. Caller ID may alternatively be used when an address associated with the telephone number is available.
In yet another optional feature, consumer user's who call into the inventive service will be able to rate the particular business after they have utilized the businesses goods or services. For example, a consumer user having been referred to a restaurant using inventive system can later call in using a toll-free or free local phone number and provide feedback, such as in the form of a rating, relative to their experience. These ratings would then be compiled and made available to the local businesses. Hopefully such feedback would encourage the businesses to either maintain their high quality of service or to improve the quality of their service and/or goods in response to the consumer user's rating.
In another embodiment, these ratings were also serve as an additional information source for consumer user's and would be available either or telephone or on Internet based website. The business establishment having demonstrated a particularly high-level of goods or service based on these ratings would be placed into a category of highly rated businesses, such as "BayHits", would be available to the consumer user during his or her call into the system. So for example, when the user calls in to request "Italian restaurant in Palo Alto", if in one ofthe candidate restaurant played back to the caller happens to be a "BayHit" then that restaurant would be indemnified as such. For example, the caller might receive a message "II Fornaio - a
BayHit". Alternatively, consumer user may be able to request "BayHit Restaurants" and receive only a list of restaurants satisfying the BayHit criteria. In some embodiment of the invention, the rating or BayHit feature may be provided free to the businesses while in an alternative preferred embodiment businesses falling within the highly rated or "BayHit" category would be charged the nominal fee. Those workers having ordinary skill in the art in light of description provided here will appreciate that this rating and promotion scheme may be implement in a variety of ways and that the particular descriptions provided here are merely exemplary ofthe more general method. The ratings may alternatively or additionally be provided on an Internet website (such as http://www.bavhits.com so that information obtained from caller's using inventive system method would be available to other individuals and businesses as well. BayHits is the name for a promotional vehicle for local businesses to promote their products and services through the telephone and the web to the San Francisco Bay Area (or any other) population. Analogous names may be applied to other regional markets. Businesses offer a promotional discount of some sort in order to participate and consumers have to register and participate in the "ratings" in order to earn points redeemable towards subscribed discounts. For example, John Doe registers either at the web to be a BayHits Rater. He is immediately given some points (for example 500 points). He can spend these points in exchange for "deals" from participating businesses. Once he calls in and rates one of the participating vendors, he will earn points for each rating. Ratings will be made available to the participating vendors on the web site. Each participating vendor will be able to see the number of redemptions, ratings for their own business compared to the rest of their category without specific business names. This will help them understand where they stand against their competitors and give them a chance to change their promotional strategy. This is a win-win program, since participating consumers continue to win points, and hence discounts from participating vendors. Participating businesses win because they get increase in patronage and get real time feedback through the phone or the web. If businesses get high ratings, they will be promoted free of charge on the Talk41 1 phone and web services by the label "BayHit". For example, if ABC Pizza is rated high enough to be a BayHit, every time someone calls to find out info about ABC Pizza, it will be announced "ABC Pizza, a
BayHit". Furthermore, when callers call Talk411 , they will be able to ask for "BayHits" in a specific Category and get connected. In one business model participation in the BayHits program will be free until brand is recognized and momentum is built with participating vendors. Subscription to the Talk411 service is a prerequisite to become a participating business in the BayHit program.
Other optional services and procedures may also be provided. An optional Audio Talent brokerage portal provides a list of audio (voice) talents. In a reserved area of the web site a list of audio talents are provided to the business users. Once the business user clicks on the audio talent, a pre-recorded message is played back. The user selects to purchase the services ofthe audio talent of choice, and provides payment means, such as for example credit card or other account information necessary for the transaction.
An Audio Coupon redemption tracking mechanism by auto decrementing a total count of purchased coupons. Audio coupons are purchased through the phone service or through the web site. Once the coupon message is saved it is ready for delivery to users. Once the user gets connected to the business who has purchased the coupons, a coupon counter gets decremented by one. The balance of available coupons is updated in the business account information provided to the business on the phone or on the web site.
Promotional messages may be displayed in an Internet web-based search box. These messages may be rotated based on predetermined criteria, such as x many messages in y minutes. When the user hovers or clicks the mouse or other pointing device over the message box the message will get fixed. When the user hits return, a page from the Talk411 web site will be served with information about the business with the displayed message. If the user starts typing, the box will be cleared and accept the keystrokes of the user. Facsimile Business Information Input Service Embodiment provides for fax input of business information that gets scanned and converted to text and get played by TTS over the phone. In order to motivate and provide ease-of-use to the business users, a fax back service will be provided, where when the user calls in to the Talk411 service and wants to be a member, he/she will be faxed a form page with fields of information that need to be filled out and faxed back to Talk411. This information will be scanned, edited and accepted to the system. Then a fax will be sent back to the new Business User notifying that a membership is formed and that the user should call Talk411 to check and accept the information.
Some of the steps in this procedure can be changed, left out, or combined to make the user interaction to be a satisfying experience as will readily be understood by workers having ordinary skill in the art in light ofthe description provided here.
Exemplary Typical Call Flow for a Member Business Using Talk411 Service
Some typical representative call flow scenarios are now described. These are merely illustrative as both the content and ordering ofthe content may change to suit local or region speech patterns, the goals ofthe call service, the demographics ofthe caller to the extent they can be determined, and numerous other objective and subjective or cultural factors.
In this embodiment, a new business manager/owner calls in and registers to be a member. She/he is asked to provide a credit card number for business information verification and services. Then, she/he gets prompted to verify the existing information and subscribe 3 months free of charge to the basic services where she is told she can record her promotional or informational message at the prompt. Then she/he is given a chance to review the existing business information in the Talk411 database, edit information if not correct, review and publish. Then she is given a chance to review and publish her promotional or informational message and asked if she wants to learn about enhanced membership services. If she/he says yes, then she is briefly explained what those are and given a chance to subscribe. She/he is told she can always change her message, subscribe for additional services either on the telephone or on the Talk41 l.com web site.
First Exemplary Script for Consumer Call:
"Hello, welcome to Talk411, Bay Area's fresh information source" "Please speak clearly. You can cut me off at any time, I won't get offended" "If you are a member or want to become one, say 'Yes', if you want me to Find a business for you, say the name, or the type ofthe business" ABC Pizza
"ABC Pizza is at 408-255-0000, say "connect", or say "more" for more info" More "There is no additional info. Do you want me to connect now or would you like me to recommend others in the area?" Others "Choice Domino. Choice 2: Straw Hat with 10% off on any large Pizza, say your choice"
If answer is: Straw Hat "You will need the following number to get your discount. Write it down while I connect. It is: SHI. Again, it is SHI. Thank you for calling us" If answer is: Domino 's
"All of Domino's Pizzas are made with real cheese and guaranteed to be hot on delivery! Thank you for calling us."
Exemplary Script for Business Call: "Hello, welcome to Talk411, fresh info source for the Bay Area"
"Please speak clearly."
"If you are a member or want to become one say 'Yes', if you want me to find a business for you, say the name or the type of business" / want to become a Member "What is your business name or business phone number?"
ABC Pizza "I do have some info for your business. Please answer my questions so I can process your membership. Your first three months are free of charge. We do require a major credit card so we can continue to provide you with uninterrupted service and give you the ability to order audio promotions and coupons any time any where on the phone or at the Talk411 web site. We will not charge your card until your 3-month trial period is complete. Please provide a MasterCard or a VISA credit card number followed with the expiration date" xxxx xxxx xxxx xxxx and the expiration is 03/01 "What is the full name on the card?"
John Doe "What is the zip code for the billing address?"
94102 "Please wait a moment while we process your information" (may play an ad here) "Thank you. Your password is PassOO 1. Please write it down. Again it is PassOOl . Now you can dictate a short message that I will deliver to callers when they request your business information. You can change this message anytime here or at Talk411 web site by signing in with your password. Please dictate at the prompt or say Exit"
ABC Pizza is located right next to the Great Mall and we serve anywhere in the Bay Area
"This is how I understood your dictation: " ABC Pizza is located right next to the Great Mall and we serve anywhere in the Bay Area ". If this is not correct please repeat the message. If it is correct, say so" (after two tries, go to the Talk411 web site to type in your message). Correct
"Congratulations. Now all callers who request your business info will hear your message. I can promote your business in other ways as well such as Category Sponsorships and Audio Coupons. Please visit our web site at www.Talk411.com and give me a call again. When you are ready to purchase additional services you can say "Promote" any time and I will step you through it. Thank you for your business"
Exemplary Script for Experienced Consumer Call:
"Hello, wel...
Pizza Coupons
"What is your zip code?" 94122
"Say your choice any time to redeem your coupon and connect:
Domino's two for one for any large pizza
Straw Hat 10% all Medium size pizzas
Big John's 50% off the second pizza... Big John 's "You will need the following number to get your discount. Write it down while I connect. It is: BJ1. Again, it is BJ1. Thank you for calling us"
Exemplary Script for Experienced Business Call:
"Hello, wel... Member ABC Pizza
"Please say your password"
PassOOl "Do you want to change your message, or purchase new promotional services?" Promo "Extended Message, Category Sponsorship, General Sponsorship or Audio Coupon?"
Extended Message "This feature is an additional $20 per month. You will be able to provide the information most callers are requesting for your type of business by answering the questions I will ask, or you can leave a general message X seconds long. Say "Template" or "General message" to pick one"
Template "Do you deliver?"
Yes "Do you charge for delivery?" No
"Which zip codes or cities do you deliver to?"
San Jose, Milpitas "What is your house specialty pizza and how much does an 18" one cost?" ABC Supreme and it costs $15 "Do you offer per slice and if you do how much for a pepperoni/cheese slice?"
Yes, it is $2.50 per slice "What are the closest freeway and the name ofthe exit?"
101, exit is 1st Street "Do you take credit cards, if so which ones?" Yes, all major credit cards
"Do you promote your specials on Talk411 service and the web site?" Yes "What the most ordered Pizza?"
Supreme (if not understood invite to input this field at the web site, or try again later) "Thank you. All of this information will be displayed on the Talk41 1 web site and will be available to callers who are interested in listening." (Note: A feedback mechanism to verify the inputs are correctly recognized may also advantageously be provided.)
"Do you want to purchase Audio Coupons?" Yes
"Audio Coupons are sold in quantities of 100, 500, 1000, 5000, 10000. Price for 100 is $1 each, 500 is 90cents, 1000 is 80cents, 5000 is 75cents and 10000 is 70cents per coupon. Please pick a quantity" One thousand "Please dictate the discount message you want to communicate"
Purchase one large pizza and get a medium pizza free "Thank you. Callers who request to hear coupons or info about your business will hear your message. If you would like to be the first or second heard when callers ask for your category, please say "promo category sponsorship, otherwise say exit or hang up" Promo category sponsorship
"Do you want to check availability by date or by preference order?"
Preference order "#1 is available April 8 at $5000, #2 is available May 15 at $2500, and #3 is available now at $1000. All prices are valid for 3 months, and can be purchased for one, two or three months. Please say the pecking order you want to purchase followed by the number of months."
#1 for 3 months "Your total will be $15,000. Would you like us to charge this to your credit card on record or would you like to send a check?" Send a check
"You have till xx/00 to send your check. If not received by this date, your promotion will be cancelled, and a deposit may be required in the future. Please send your check for $15,000 to Talk41 1 Service, P.O. Box 10000, San Jose, CA 95131, again (repeat).... Thank you for your business. You can check the status of your account and order additional services through the Talk411.com web site. Thank you for your business." It is noted that for experience callers, the scripted message may be interrupted by the caller so that his or her response may be acted upon more quickly. It is not necessary for the caller to listen to the entire prompt before responding. This feature reduces or eliminates potential irritation at having to wait to respond.
Exemplary Script and Call Flow for Caller Interaction:
Caller interaction with the inventive system is important to the commercial success ofthe system, method, and business. In general, consumers do not tolerate an interaction that is overly rigid or inflexible, or that requires carefully formatted responses. FIG. 7 is a flow-chart diagram setting out the caller flow and interaction with the system for a method of interaction between the inventive system and a caller.
The flow-chart of FIG. 7 is broken into FIG. 7 A - 7E for convenience in presentation. The flow-chart includes call flow procedures as well as procedures for articulating messages to the caller and for receiving inputs from the caller and may serve as a basis for developing the call handling procedure and the talk41 l .vs script file. With reference to FIG. 8 (FIG. 8A-8B), an exemplary embodiment of a second different call handling procedure 700 is provided. This is another example of a call handling procedure which, like the others provided here, may form the basis for a VOS, voiceXML, or C++ based script implementation. It starts 702 with receipt of a telephone call 703. The call may be directed in one of two ways, to member services 704 or to information services 750. For the member services path 704, a determination is made as to whether it is an existing member or a new member 705. If it is not new, then the member may be either a member 706 or a business member 718. If the member is a business member 718, then the member's ID is obtained and verified 719. Next a check is made as whether they wish to check a rating or request information on, make or modify a promotion 720. If they choose to check their rating, they are given the requested rating information 721 and then asked if they wish to check their rating or promotion again 720. This allows them to sequentially address both rating and promotional issues without requiring a call back. The member is then permitted to select between sponsor and coupon paths 724. Should the business member choose coupons, they are provided with information concerning coupon options and/or coupon pricing 725 and then given an opportunity to buy coupons or obtain additional information 727. If they instead select sponsor, they are provided with information concerning sponsorship category options 726 and then permitted to buy a sponsorship or obtain additional information 727. Should the business member then chose to receive neither coupon nor sponsor information, the call terminates or ends 749.
In the caller chooses to receive additional information, they receive a benefits pitch 728 and are then given an opportunity to subscribe or verify 729, and if the caller business member chooses subscribe, a request is made for credit card information (or other account, billing, or payment information) 743. Credit card (or other account, billing, or payment information) is also collected if the earlier caller decision was to buy coupons or a category sponsorship 727. If the caller chooses to verify 729 and the information provided cannot be verified 730, then the caller is connected to customer service 731. Customer service may be an automated system, a live operator, or a combination of the two. If the verification can be made 730, then the caller is asked to select 744 if they wish to subscribe in which case the credit card information is obtained 743, end the call 745, or obtain more information (including at this point information that may only be available from a customer service representative 731 , but also from a benefits presentation or pitch 728. After credit card information has been obtained (and desirably verified) 743, the caller is asked whether they wish to record the promotional message or coupon over the phone at that time or using one of the available internet web based recording interfaces 747. Recall that the web based interfaces permit voice recording, typing the promotional item for automated text to speech conversion, utilizing a voice talent, using a synthesized celebrity or historical characters voice, or other option. If the caller chooses to record now, the caller is given an opportunity to record and verify the recording including opportunity to re-record until satisfied 748. After selecting the recording option, the caller is again asked to choose other options relating to sponsorship and coupons 724 as already described. It is noted that other options and call flow may be added or modifications made and that the particular choices available at any point may readily be altered. It is further noted with reference to the flow chart in FIG. 8, that after the caller selected member services 704 and identified themself as a new member 705, somewhat different information is required before being asked if they wish to subscribe or verify 729 or be connected to customer service 731 , where the afore described procedures are repeated for the new business. In particular, the system asks if they are a business 737, and if they identify themself as a business provide their business name and telephone number 740. It the matching business is found, they are asked to subscribe or verify 729 and which point they make a selection and proceed as already described for existing members. In on the other hand they are not a business 737, they are requested to provide their telephone number and password 738, and to chose between setup over the telephone and web based set up 739. If they choose web-based set up, set up will occur over the web at a later time, and the call ends. If they do not select web set up, the caller is given an opportunity to learn about specials and avail themself of specials as desired 713. The caller may then choose 714 to connect to a business or modify from a list 715. This opportunity is repeated so that the caller may select different available options. Once the caller chooses to connect to a business, the caller is connected 716, and the call to the service ends 717.
Referring back to the point in the procedure when the caller selected member services 704, and indicated that they were not a business member 718, but rather an other member 706, the members identity is obtained and verified 707. That caller member is then asked to choose between ratings, points check, or My 708. If the caller chooses My, the caller presented with specials 713 as previously described. If the caller selects ratings check the caller is advised ofthe business and rating 710, and if the caller selects check points, is advised ofthe current points earned 709. The rate/check/my loop iterates so that the caller can cycle through and obtain more than one type of information while connected.
Again referring to FIG. 8, the incoming call 703 may be a request to obtain information services from a non-member (or even from a member) caller 750. Usually the request will be for one of four possible information types, though other types may clearly be added, and some existing ones deleted. A request may be for a yellow pages (YP) type request where the caller desires to find matches in a category 752. When such request is made, a predetermined number of matches are provided, in this case three are provided at a time, with a total of up to nine if additional matches are requested 756. These numbers may readily be modified according to response and need. A different request may be to find coupons 753. Agin three are initially provided in the requested category, with up to fifteen being provided as requested 757. The request may also be a request to find BayHits 754, and in similar manner three matches are initially provided with up to nine total matches as requested 758. After any of these three options are exercised 752, 753, 754, the caller is given the opportunity to select one ofthe matches or request more 759. The caller may alternatively make more specific 411-type (but with added features) directory assistance type request for the name of a business 751 , optionally specifying the name and coupon number 755. In this situation as well as when one of the other options were selected 759, the caller is given an opportunity 760 to connect to the business 763 or to obtain more information about the business 761 before connecting. If the caller chooses to connect 763, the call is announced along with any promotion or coupon promise 764, at which time the call to the service ends 765. If the caller requested more information 761, such information is provided, and the caller is asked to choose between connecting to the business 763, ending the call 765, or returning to information services 750.
Internet Web-Based System And Method
The inventive system and method are also implemented from an Internet or other networked computing environment using conventional keyboard, mouse, and display interfaces or with the addition of voice input and speech-recognition capability. Therefore it should be understood that the system and method described relative to the voice recognition embodiment may be implemented from a personal computer or other information appliance that supports voice input, such as using a microphone and modem connection to connect over an analog or digital telephone or other radio- frequency, optical, cable, or wireless communication link. Alternatively, though with some sacrifice in ease of use, the voice recognition component may be substituted by a typed or character string based interface. In this light, rather than repeating the various embodiments ofthe invention for an Internet Web based implementation, this section shows various web screen shots and the manner in which web pages are organized and navigated to provide an easy efficient and enjoyable experience. It is further noted that certain options are provided in the voice-recognition implementation to conduct some ofthe more detailed and time consuming steps at a later time using the talk411 inventive system.
The Talk411 Internet application is developed using Microsoft's IIS (Web Server) and Active Server Page (ASP) technology. The Active Server Pages access the same database as the Voice system and deliver similar information. The Internet web-based Talk411 application includes both HTML and ASP files, both of which file types are well known in the art. Basically ASP files are HTMP files that include some active content. Exemplary html and asp code is provided for the getbusiness.asp file, a portion of which pertaining to "Dim Business" is duplicated in Table X. Angle brackets identify key words, and anything that starts with a bracket and a % sign is active server page code. The indicator "html" is a label for the html page. The portion of the code identified by "DIM BUSINESS".
For the Internet or world wide web based application All active server page files (.asp file) are html pages with a little code that can create some html on the fly. In this case, since we are reading from a database and want to display the name, pre-written html does not have database interaction, nor does it know what database item it will need to provide or display. Therefore, there is a need to use active server pages (.asp) where you can put in a little code that instruct to go to data base to get name and display the information that was retrieved. An example of a portion of active server page code is provided in Table XI.
Table X. ASP and HTML Files for Exemplary Internet Web Site
Default.htm Main menu page (601)
Business. asp Gives user a list of valid businesses to select from (602)
Types.asp Gives user a list of valid business types to select from (603)
BayHits. asp Displays business that meet the BayHits criteria (604)
Coupons. asp Gives user a list of businesses with coupons to select from (605)
Members, asp Member login form (606)
GetBusiness.asp Displays information for a given business (607)
GetTypes.asp Gives user a list of valid business for a specified type (608)
MemberMenu . asp Validates member ID and password, displays menu (609)
UpdateBusiness . asp Validates business information from MemberMenu.asp for update (610)
RateBusiness.asp Validates business rating and updates database (611)
Rate.asp Allows user to select a rating for a business (612)
InvalidRater.htm Displays if rater number is incorrect (613)
InvalidMember.htm Displays if member login is incorrect (614)
Partners.htm Displays Talk411 partner list (615)
Table XI. Exemplary portion of active server page and html page code for talk411 Internet application.
<%
Dim Business
Business = Request ( "Business" )
%>
<form name=TypeForm method=post action- "GetTypes .asp" >
<% Set conn = CreateObject ( "ADODB. Connection") %>
<% conn. Open "Driver-Microsof Access Driver
(*.mdb) ;DBQ=c:\ιnetpub\clιents\talk411\dιrectory2.Mdb" %>
<% Set rs = conn. Execute ("SELECT * FROM [Customers] WHERE [CompanyName] = '" <
Business & " ' ") %>
<hl><%= rs ("CompanyName" ) %></Hl>
<h2>Phone: (<%= Mid (rs ( "PhoneNumber") , 1, 3) %>) <%= Mid (rs ( "PhoneNumber" ), 4 , 3)
%>-<%= Mid (rs("PhoneNumber" ) ,7,4) %><BRxBR>
<%= rs( "Address") %><BR>
Figure imgf000079_0001
"Specialty" ) %></H3> Those workers in the art will appreciate the flexibility available in web page design and construction using html alone or in conjunction with asp, therefore only a few examples of the above listed web pages are illustrated to provide examples of a relatively simple graphical user interface. The use of various buttons, hot spots, pull- down menus, imbedded GIF files and the like are well known in the art and not described in any detail here.
FIG. 9 is an example ofthe Coupons.asp and gives user a list of businesses with coupons to select from (605). This particular embodiment of the coupons web page interface provides Find A Business, Search By Type, BayHits! , Coupons, and Members button or menu bar. The coupons web page also provides a scrollable list for
Businesses with Coupons.
FIG. 10 is an example of a GetBusiness.asp page (607) and provides information for a particular business identified by the user. The web page presents similar buttons to those on the Coupon web page so that the user can readily navigate between the pages using a common interface. The get a business page also provides a menu button for selecting another business in the same category. In this case, other auto repair and service. BayHits and a rate me menu button are also provided.
FIG. 11 provides an example of a member menu web page generated by a MemberMenu.asp file that validates member ID and password, displays menu (609). It provides a box for entry of a member ID number and another box for entry of the member Password. The user enters this information and clicks a submit button to submit the information and begin processing. Security measures as are known in the art may optionally be implemented.
FIG. 12 provides an example of a member menu for purchasing and editing a coupon message. Other embodiments ofthe web page for coupon generation provide templates for entering and arranging the information so that the member may try different alternatives for web based coupons that may be printed. It may also provide an interface for picking voices, sampling voice talent, selecting celebrity voices, and selecting simulated voices for text-to-speech conversion. These may be tested at the web site and made available for the telephone handset based implementation.
FIG. 13 is a diagrammatic illustration showing the relationships between the various web pages and web page asp files described in Table XL Various other paths between web pages are possible and may be used. Alternative web pages may be designed, and features from multiple web pages may be combined into a single or into fewer web pages.
Additional Description
The foregoing descriptions of specific embodiments of the present invention have been presented for puφoses of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise forms disclosed, and obviously many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles ofthe invention and its practical application, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the claims appended hereto and their equivalents. All publications and patent applications cited in this specification are herein incoφorated by reference as if each individual publication or patent application were specifically and individually indicated to be incoφorated by reference.

Claims

I Claim:
1. An operating model for a telephone-based voice-interactive goods and services information and referral service having merchant self-promotion features, comprising: an information database provider storing merchant information, said information including a name, a telephone number, an address, and a promotional information item; a merchant interface for inputting merchant information into said database and for retrieving and editing said information, at least a component of said merchant interface comprising a voice-recognition interface and an internet interface; and a consumer interface for inputting voice commands and data having a voice recognition component and for receiving merchant information and processed information from said database in response to said input voice commands and data.
2. The operating model in claim 1, wherein said consumer interface comprises a telephone handset.
3. The operating model in claim 2, wherein said consumer also inputs non- voice commands and data from a keypad on said telephone handset.
4. The operating model in claim 2, wherein said telephone handset comprises a mobile telephone.
5. A data-base access system comprising: a voice recognition engine receiving voice input data and commands from an external device over a communication link and converting the voice input into digitally represented character-based commands and data; a data base storing a plurality of data items, said data items including a name, a telephone number associated with said name, an address associated with said name, a category associated with said name, and at least one promotional data item; a search engine searching the database for a particular data item in response to said converted command and data; a speech engine providing a speech-based representation of said particular data item identified in said database search, said speech-based representation being selected from the set consisting of a text-to- speech conversion engine, a prerecorded live voice, and combinations thereof; and a speech server for communicating said speech-based representation of said particular data item to said external device.
6. A system as in any of the preceding system claims, further comprising the external device, wherein said external device comprises a voice/speech input device.
7. A system as in any of the preceding system claims, further comprising the external device, wherein said external device comprises a telephone.
8. A system as in any of the preceding system claims, further comprising the external device, wherein said external device comprises a device selected from the group consisting of a personal computer, notebook computer, personal data assistant (PDA), information appliance, or combination thereof.
9. A system as in any ofthe preceding system claims, wherein said communication link comprises the Internet.
10. A system as in any of the preceding system claims, further comprising means for communicating and validating a promotional audio coupon.
11. A system as in any of the preceding system claims, further comprising means for receiving rating inputs from users and for providing processed ratings inputs to consumers.
12. A system as in any of the preceding system claims, wherein said name comprises the name of a business, and said at least one promotional item is selected from the set consisting of a voice coupon data item, a business rating information data item, a spoken self-promotion item associated with the name.
13. A system as in any of the preceding system claims, wherein said system provides audio coupons that operate as incentives for consumers to use the inventive system.
14. A system as in any of the preceding system claims, wherein said system provides means for submitting and retrieving ratings for goods and/or services.
15. A system as in any ofthe preceding system claims, wherein said system assists in providing directory driven wireless commerce.
16. A system as in any ofthe preceding system claims, wherein said system provide a promotion and advertising channel that has geographical and sociological reach and the speed needed in today's dynamic financial and commercial markets.
17. A system as in any ofthe preceding system claims, wherein said system provide a voice-interactive dynamic market place where individuals call to save and businesses call to publish sales promotions in real-time or near real-time.
18. A system as in any of the preceding system claims, wherein said system communicates information on an as requested basis that goes beyond the telephone number and includes address information and non-address and non-telephone number information.
19. A system as in any of the preceding system claims, wherein said system provides instant savings with voice coupons published by local merchants.
20. A system as in any ofthe preceding system claims, wherein said system provide an advertising free initial experience where voice or audio coupons are only heard attached to businesses that the caller has requested or searched for.
21. A system as in any ofthe preceding system claims, wherein said system provide for hands-free navigation with voice commands on any telephone or device supporting telephony.
22. A system as in any of the preceding system claims, wherein said system provides a low cost 411-type (or equivalent international operator or non-operator assisted) directory assistance or information with added informational features that is easy to access.
23. A system as in any of the preceding system claims, wherein said system provides benefits to merchants including but not limited to targeted reach, instant promotion, instant or near-term feedback, and an optional free Internet web presence.
24. A system as in any of the preceding system claims, wherein said system provides benefits to common carriers and telephone companies who save conventional
411 costs, process higher call volumes, and attract new customers.
25. A system as in any of the preceding system claims, wherein said system provides the closest locations for a particular requested category where the location of the caller is known from a caller location input, cellular signal triangulation, GPS position determination, or other position or proximity location means.
26. A system as in any of the preceding system claims, wherein said system provides means for a merchant to interact with the system using either voice or web interface and select templates for the type of social, economic, political, age, gender, profession, or other image the merchant wants to portray and the type of promotion message the merchant wants to publish with a small number of mouse clicks, key strokes, or voice commands and prompts.
27. A system as in any of the preceding system claims, wherein said system provides means for establishing user groups and communities based on lifestyles, usage patterns, interests and interest levels so that a registered user can subscribe to a group of multiple groups where merchant listings and other relevant information is given a priority.
28. A system as in any of the preceding system claims, wherein said system provides means for obtaining ratings, in which once a caller gets connected to the business through the talk411 service system, after a period of time measured in hours the service calls back the caller to ask for ratings or to collect feedback to improve service, where the caller has either registered to permit this inquiry or does not have caller ID blocked.
29. A system as in any ofthe preceding system claims, wherein said system posts customer testimonials so that future callers can hear these messages as a reference that may help make a choice of which merchant they want to be connected with, and optionally, as the service gets used callers can leave testimonial messages which the business can choose to post for other users access.
30. A system as in any of the preceding system claims, wherein said system provides means for merchants to post key words on the voice system or internet site which can be used as a search term by the caller and that will be used as a navigation pointer to the posting merchant.
31. A system as in any ofthe preceding system claims, wherein said system allows merchants to post their promotional message or company information in multiple ways including to record their own, select from a voice talent who would record the text the merchant put in the system using voice, a personal computer, or in any other way; or just type in an let a text-to-speech processor convert it.
32. A system as in any of the preceding system claims, wherein said system provides a coupon aggregation and translation engine and service allowing aggregation of different formatted coupons from online sites and reformat such coupons to a standard format that allows them to be played over the telephone.
33. A system as in any of the preceding system claims, wherein said system provides user call back to remind the user to rate a recently used local merchant service.
34. A system as in any of the preceding system claims, wherein said system provides means for an over the telephone offer for a user to become an instant member to a community or coupon distribution list using email or other communication means once the user asks for a specific category.
35. A system as in any of the preceding system claims, wherein said system provides merchants the ability to identify specials in their "additional information" or
"coupon message" that is used to find that merchant when those words are used over the phone in the key word search mode
36. A system as in any of the preceding system claims, wherein said system provides system and method for publishing secret words in local newspapers, Internet chat rooms and other community oriented online and offline boards, where they can be used on the phone or on our web site as a password to enter a sweepstake or win a prize, and to thereby provide a beneficial marketing tactic to increase sales of print papers and increase traffic to online portals.
37. A system as in any of the preceding system claims, wherein said system provides voice coupon targeting based on area code and prefix, city, geographically coded location, GPS location, zip code, cross streets, vicinity of a milestone, major tourist areas, major landmarks, aiφorts, night clubs, entertainment centers, shopping malls, restaurants, and the like.
38. A system as in any of the preceding system claims, wherein said system provides interface and connection for the participation of a live agent interaction on a random call basis to provide a suφrise element to enhance the user experience, where live agents can be celebrities.
39. A system as in any of the preceding system claims, wherein said system provides an interface for the user to choose or for automatic choice of voice of the Talk411 attendant based on gender, age, interest, and other selection criteria.
40. A system as in any of the preceding system claims, wherein said system provides user choice to select synthesized voice of a celebrity as the automated attendant.
41. A system as in any of the preceding system claims, wherein said system provides system and method for insertion of trivia questions where correct answer wins a prize from a local merchant.
42. A business model for a business in which consumers call into a service using an ordinary telephone and make requests in plain speech for information and positive referrals on goods and/or services, and the service provides responses to the request in plain speech in real-time over the same telephone.
43. The business model in claim 42, further including providing a facility for a business to communicate a self-promotion ofthe business to the requestor.
44. The business model in claim 42, further including providing an audio promotional coupon to a requestor when the requestor completes a call to a business using the service.
45. The business model in claim 42, further including providing an audio promotional coupon to a requestor when the requestor completes a call to a business using the service.
46. The business model in claim 42, further comprising a voice coupon feature in which voice coupons are sold for distribution, a monthly fee is charged for subscription to the basic service, and additional charges are levied and collected for business category sponsorship.
47. The business model in claim 12, further comprising partnership features in which the providing organization partners with a print yellow page or other business directory publisher and/or with direct marketing organizations to subscribe merchants, businesses, individual professionals, or other organizations.
48 . The business model in claim 12, further comprising including organization partners selected from the set consisting of yellow page providers, wireless providers, telephone companies, and conventional 411 call centers, to partner and generate call traffic and thereby increase revenue.
49. The business model in claim 12, further comprising replacing conventional 411 directory assistance providers by the inventive system to save carriers 411 costs and to offer a new shared revenue channel and business model.
50. The business model in claim 12, further comprising soliciting new customers and providing them with value added services.
51. The business model in claim 12, further comprising providing direct marketers with a new coupon or other promotion distribution channel.
52. The business model in claim 12, further comprising bringing a new business to direct marketing organizations through a voice channel ofthe inventive system.
53. The business model in claim 12, further comprising providing direct marketing organizations are with rapid marketing feedback for their clients and customers.
54. The business model in claim 12, further comprising providing direct marketing organizations with rapid marketing feedback within 24 hours for their clients and customers.
55. The business model in claim 12, further comprising providing increasing the revenues of print yellow page and print business directory publishers by increasing the yellow page or directory ad size sold by virtue of ad space required by a trademark and/or "voice coupon" icon or logo.
56. The business model in claim 12, further comprising providing a business model in which print yellow page or other print directory publishers are provided with increased usage of the yellow pages or directory to show which vendors provide immediate savings.
57. The business model in claim 12, further comprising providing a business model in which yellow page or other print directory publishers are provided with an enhancement in their market positioning by virtue of their providing an more complete and compelling offering.
58. The business model in claim 12, further comprising means for facilitating spread of secret words through the word of mouth initiating from someone in the company to provide access to privileged information, prizes, etc. to enhance the repeat user experience
59. The business model in claim 12, further comprising an information and/or 411 directory assistance call center including an added revenue stream to improve profit margins, to increase call volume by providing a desirable information service, to keep their current carrier customers with value added services, to decrease their operating costs and overhead by reducing the number of human employees, and by overcoming severe local competition on wireless carriers.
60. A data-base access method comprising: receiving a speech utterance from a user from a communication device; converting the speech utterance to a digital symbol representation understandable by a computer and extracting commands and optional data from said converted digital symbol representation; searching a database for a particular data item in response to said extracted command and data; generating a speech-based representation of said particular data item identified in said database search; and serving said speech-based representation of said particular data item to said communication device.
61. The method in claim 60, wherein said communication device comprises a voice/speech input device.
62. The method in claim 60, further comprising communicating and validating a promotional audio coupon to the user at the time said speech-based representation is served to said communication device.
63. The method in claim 60, further comprising receiving rating inputs from users and for providing processed ratings users in response to said speech utterances.
64. The method in claim 60, wherein said received speech utterance comprises natural human speech, and said converting comprises natural language speech processing to extract said commands and data.
65. The method in claim 60, wherein said received speech utterance comprises computer synthesized human speech, and said converting comprises voice recognition speech processing to extract said commands and data.
66. The method in claim 60, wherein said method provides audio coupons that operate as incentives for consumers to use the inventive method.
67. The method in claim 60, wherein said method provides means for submitting and retrieving ratings for goods and/or services.
68. The method in claim 60, wherein said method assists in providing directory driven wireless commerce.
69. The method in claim 60, wherein said method provide a promotion and advertising channel that has geographical and sociological reach and the speed needed in today's dynamic financial and commercial markets.
70. The method in claim 60, wherein said method provide a voice-interactive dynamic market place where individuals call to save and businesses call to publish sales promotions in real-time or near real-time.
71. The method in claim 60, wherein said method communicates information on an as requested basis that goes beyond the telephone number and includes address information and non-address and non-telephone number information.
72. The method in claim 60, wherein said method provides instant savings with voice coupons published by local merchants.
73. The method in claim 60, wherein said method provide an advertising free initial experience where voice or audio coupons are only heard attached to businesses that the caller has requested or searched for.
74. The method in claim 60, wherein said method provide for hands-free navigation with voice commands on any telephone or device supporting telephony.
75. The method in claim 60, wherein said method provides a low cost 411 -type (or equivalent international operator or non-operator assisted) directory assistance or information with added informational features that is easy to access.
76. The method in claim 60, wherein said method provides benefits to merchants including but not limited to targeted reach, instant promotion, instant or near-term feedback, and an optional free Internet web presence.
77. The method in claim 60, wherein said method provides benefits to common carriers and telephone companies who save conventional 411 costs, process higher call volumes, and attract new customers.
78. The method in claim 60, wherein said method provides the closest locations for a particular requested category where the location ofthe caller is known from a caller location input, cellular signal triangulation, GPS position determination, or other position or proximity location means.
79. The method in claim 60, wherein said method provides means for a merchant to interact with the method using either voice or web interface and select templates for the type of social, economic, political, age, gender, profession, or other image the merchant wants to portray and the type of promotion message the merchant wants to publish with a small number of mouse clicks, key strokes, or voice commands and prompts.
80. The method in claim 60, wherein said method provides means for establishing user groups and communities based on lifestyles, usage patterns, interests and interest levels so that a registered user can subscribe to a group of multiple groups where merchant listings and other relevant information is given a priority.
81. The method in claim 60, wherein said method provides means for obtaining ratings, in which once a caller gets connected to the business through the talk411 service method, after a period of time measured in hours the service calls back the caller to ask for ratings or to collect feedback to improve service, where the caller has either registered to permit this inquiry or does not have caller ID blocked.
82. The method in claim 60, wherein said method posts customer testimonials so that future callers can hear these messages as a reference that may help make a choice of which merchant they want to be connected with, and optionally, as the service gets used callers can leave testimonial messages which the business can choose to post for other users access.
83. The method in claim 60, wherein said method provides means for merchants to post key words on the voice system or internet site which can be used as a search term by the caller and that will be used as a navigation pointer to the posting merchant.
84. The method in claim 60, wherein said method allows merchants to post their promotional message or company information in multiple ways including to record their own, select from a voice talent who would record the text the merchant put in the system using voice, a personal computer, or in any other way; or just type in an let a text-to-speech processor convert it.
85. The method in claim 60, wherein said method provides a coupon aggregation and translation engine and service allowing aggregation of different formatted coupons from online sites and reformat such coupons to a standard format that allows them to be played over the telephone.
86. The method in claim 60, wherein said method provides user call back to remind the user to rate a recently used local merchant service.
87. The method in claim 60, wherein said method provides means for an over the telephone offer for a user to become an instant member to a community or coupon distribution list using email or other communication means once the user asks for a specific category.
88. The method in claim 60, wherein said method provides merchants the ability to identity specials in their "additional information" or "coupon message" that is used to find that merchant when those words are used over the phone in the key word search mode
89. The method in claim 60, wherein said method provides means for publishing secret words in local newspapers, Internet chat rooms and other community oriented online and offline boards, where they can be used on the phone or on our web site as a password to enter a sweepstake or win a prize, and to thereby provide a beneficial marketing tactic to increase sales of print papers and increase traffic to online portals.
90. The method in claim 60, wherein said method provides voice coupon targeting based on area code and prefix, city, geographically coded location, GPS location, zip code, cross streets, vicinity of a milestone, major tourist areas, major landmarks, aiφorts, night clubs, entertainment centers, shopping malls, restaurants, and the like.
91. The method in claim 60, wherein said method provides interface and connection for the participation of a live agent interaction on a random call basis to provide a suφrise element to enhance the user experience, where live agents can be celebrities.
92. The method in claim 60, wherein said method provides an interface for the user to choose or for automatic choice of voice of the Talk411 attendant based on gender, age, interest, and other selection criteria.
93. The method in claim 60, wherein said method provides user choice to select synthesized voice of a celebrity as the automated attendant.
94. The method in claim 60, wherein said method provides means for insertion of trivia questions where correct answer wins a prize from a local merchant.
95. A computer program product for use in conjunction with a computer system, the computer system including a processor and memory associated with said processor for executing instructions, said computer program product comprising a computer readable storage medium and a computer program mechanism embedded therein, the computer program mechanism providing a voice-recognition based data matching and retreival capability, the computer program mechanism comprising: a program module that directs the computer system, to function in a specified manner, the program module including instructions for: receiving a speech utterance from a user from a communication device; converting the speech utterance to a digital symbol representation understandable by a computer and extracting commands and optional data from said converted digital symbol representation; searching a database for a particular data item in response to said extracted command and data; generating a speech-based representation of said particular data item identified in said database search; and serving said speech-based representation of said particular data item to said communication device.
96. The computer program product of claim 95, wherein said computer system comprises a computer server.
97. The computer program product of claim 96, wherein said computer system comprises a hardware/software based voice recognition engine.
98. The computer program product of claim 97, wherein said program module further includes instructions for providing a voice coupon to a merchant coupled with a request and a match to the database.
99. The computer program product of claim 98, wherein said communication device comprises a telephone and said user communicates with voice to affect said communication.
100. The computer program product in claim 99, wherein said program module further includes instructions for identifying a geographical region of said user and for matching to businesses within said database matching a geographical proximity to the caller.
101. A voice-coupon data structure comprising: a business name data field stored in said data structure; a business phone number data field stored in said data structure; a business promotional data filed storing a voice- or voice renderable- promotional offer upon access to said business name and business telephone number.
102. The voice-coupon data structure in Claim 101, further comprising a computer receiving a request for said business name, and communication link for coupong said database to said computer.
PCT/US2001/009705 2000-03-24 2001-03-26 Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback WO2001073750A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU2001249478A AU2001249478A1 (en) 2000-03-24 2001-03-26 Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback
CA002404343A CA2404343A1 (en) 2000-03-24 2001-03-26 Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US09/534,700 US6658389B1 (en) 2000-03-24 2000-03-24 System, method, and business model for speech-interactive information system having business self-promotion, audio coupon and rating features
US09/534,700 2000-03-24
US21907900P 2000-07-18 2000-07-18
US60/219,079 2000-07-18

Publications (2)

Publication Number Publication Date
WO2001073750A2 true WO2001073750A2 (en) 2001-10-04
WO2001073750A3 WO2001073750A3 (en) 2002-01-24

Family

ID=26913539

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/009705 WO2001073750A2 (en) 2000-03-24 2001-03-26 Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback

Country Status (3)

Country Link
AU (1) AU2001249478A1 (en)
CA (1) CA2404343A1 (en)
WO (1) WO2001073750A2 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003036918A2 (en) * 2001-10-19 2003-05-01 Siemens Aktiengesellschaft Distributed interactive speech recognition system
GB2386289A (en) * 2002-03-04 2003-09-10 Rockwell Electronic Commerce Intelligent interactive voice response for call centres
WO2004021688A1 (en) * 2002-08-30 2004-03-11 Telefonaktiebolaget L M Ericsson Intelligent peripheral for speech recognition in networks
GB2393604A (en) * 2002-09-24 2004-03-31 Rockwell Electronic Commerce Translating media type of agent communication based upon agent preference
EP1619663A1 (en) * 2004-07-13 2006-01-25 Hewlett-Packard Development Company, L.P. Speech enabled computing system
US9239987B1 (en) 2015-06-01 2016-01-19 Accenture Global Services Limited Trigger repeat order notifications
US9773286B2 (en) 2012-03-14 2017-09-26 Accenture Global Services Limited System for providing extensible location-based services
US9799067B2 (en) 2008-02-11 2017-10-24 Accenture Global Services Limited Point of sale payment method
US9858614B2 (en) 2015-04-16 2018-01-02 Accenture Global Services Limited Future order throttling
US10146587B2 (en) 2006-08-24 2018-12-04 Accenture Global Services Limited Future locking of resources
US10176494B2 (en) 2004-02-27 2019-01-08 Accenture Global Services Limited System for individualized customer interaction
CN110992952A (en) * 2019-12-06 2020-04-10 安徽芯智科技有限公司 AI vehicle-mounted voice interaction system based on RTOS
US10650437B2 (en) 2015-06-01 2020-05-12 Accenture Global Services Limited User interface generation for transacting goods
US20210019787A1 (en) * 2011-05-10 2021-01-21 Soundhound, Inc. Query-specific targeted ad delivery

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113645124B (en) * 2021-08-09 2022-09-30 南京易米云通网络科技有限公司 System and method for pushing social platform information through intelligent voice interaction platform

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5884266A (en) * 1997-04-02 1999-03-16 Motorola, Inc. Audio interface for document based information resource navigation and method therefor
US5903652A (en) * 1996-11-25 1999-05-11 Microsoft Corporation System and apparatus for monitoring secure information in a computer network
US5991739A (en) * 1997-11-24 1999-11-23 Food.Com Internet online order method and apparatus
US6055513A (en) * 1998-03-11 2000-04-25 Telebuyer, Llc Methods and apparatus for intelligent selection of goods and services in telephonic and electronic commerce
US6134548A (en) * 1998-11-19 2000-10-17 Ac Properties B.V. System, method and article of manufacture for advanced mobile bargain shopping

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5903652A (en) * 1996-11-25 1999-05-11 Microsoft Corporation System and apparatus for monitoring secure information in a computer network
US5884266A (en) * 1997-04-02 1999-03-16 Motorola, Inc. Audio interface for document based information resource navigation and method therefor
US5991739A (en) * 1997-11-24 1999-11-23 Food.Com Internet online order method and apparatus
US6055513A (en) * 1998-03-11 2000-04-25 Telebuyer, Llc Methods and apparatus for intelligent selection of goods and services in telephonic and electronic commerce
US6134548A (en) * 1998-11-19 2000-10-17 Ac Properties B.V. System, method and article of manufacture for advanced mobile bargain shopping

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003036918A3 (en) * 2001-10-19 2003-09-18 Siemens Ag Distributed interactive speech recognition system
WO2003036918A2 (en) * 2001-10-19 2003-05-01 Siemens Aktiengesellschaft Distributed interactive speech recognition system
GB2386289B (en) * 2002-03-04 2005-06-22 Rockwell Electronic Commerce Intelligent interactive voice response unit
GB2386289A (en) * 2002-03-04 2003-09-10 Rockwell Electronic Commerce Intelligent interactive voice response for call centres
US7606713B2 (en) 2002-08-30 2009-10-20 Telefonaktiebolaget L M Ericsson (Publ) Intelligent peripheral for speech recognition in networks
GB2407737B (en) * 2002-08-30 2006-05-17 Ericsson Telefon Ab L M Intelligent peripheral for speech recognition in networks
WO2004021688A1 (en) * 2002-08-30 2004-03-11 Telefonaktiebolaget L M Ericsson Intelligent peripheral for speech recognition in networks
GB2393604A (en) * 2002-09-24 2004-03-31 Rockwell Electronic Commerce Translating media type of agent communication based upon agent preference
GB2393604B (en) * 2002-09-24 2005-11-09 Rockwell Electronic Commerce Media translator for transaction processing system
US7058578B2 (en) 2002-09-24 2006-06-06 Rockwell Electronic Commerce Technologies, L.L.C. Media translator for transaction processing system
US10176494B2 (en) 2004-02-27 2019-01-08 Accenture Global Services Limited System for individualized customer interaction
EP1619663A1 (en) * 2004-07-13 2006-01-25 Hewlett-Packard Development Company, L.P. Speech enabled computing system
US10146587B2 (en) 2006-08-24 2018-12-04 Accenture Global Services Limited Future locking of resources
US10089677B2 (en) 2008-02-11 2018-10-02 Accenture Global Services Limited Point of sale payment method
US9799067B2 (en) 2008-02-11 2017-10-24 Accenture Global Services Limited Point of sale payment method
US20210019787A1 (en) * 2011-05-10 2021-01-21 Soundhound, Inc. Query-specific targeted ad delivery
US9773286B2 (en) 2012-03-14 2017-09-26 Accenture Global Services Limited System for providing extensible location-based services
US9858614B2 (en) 2015-04-16 2018-01-02 Accenture Global Services Limited Future order throttling
US10007947B2 (en) 2015-04-16 2018-06-26 Accenture Global Services Limited Throttle-triggered suggestions
US9760833B2 (en) 2015-06-01 2017-09-12 Accenture Global Services Limited Trigger repeat order notifications
US9239987B1 (en) 2015-06-01 2016-01-19 Accenture Global Services Limited Trigger repeat order notifications
US10650437B2 (en) 2015-06-01 2020-05-12 Accenture Global Services Limited User interface generation for transacting goods
CN110992952A (en) * 2019-12-06 2020-04-10 安徽芯智科技有限公司 AI vehicle-mounted voice interaction system based on RTOS

Also Published As

Publication number Publication date
CA2404343A1 (en) 2001-10-04
WO2001073750A3 (en) 2002-01-24
AU2001249478A1 (en) 2001-10-08

Similar Documents

Publication Publication Date Title
US9002712B2 (en) Voice-interactive marketplace providing promotion and promotion tracking, loyalty reward and redemption, and other features
US20020035474A1 (en) Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback
US20030125958A1 (en) Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback
US6658389B1 (en) System, method, and business model for speech-interactive information system having business self-promotion, audio coupon and rating features
US6895084B1 (en) System and method for generating voice pages with included audio files for use in a voice page delivery system
US7457397B1 (en) Voice page directory system in a voice page creation and delivery system
US6707889B1 (en) Multiple voice network access provider system and method
US6501832B1 (en) Voice code registration system and method for registering voice codes for voice pages in a voice network access provider system
EP1269732B1 (en) Interacting with a data network using a telephonic device
US7376586B1 (en) Method and apparatus for electronic commerce using a telephone interface
US6400806B1 (en) System and method for providing and using universally accessible voice and speech data files
KR100870798B1 (en) Natural language processing for a location-based services system
US8122057B2 (en) System and method for the transformation and canonicalization of semantically structured data
US20020126813A1 (en) Phone based rewards programs method and apparatus prepared by tellme networks, Inc
KR100585347B1 (en) A method of providing location-based services and a location-based services system
KR20020093852A (en) System and method for voice access to internet-based information
WO2001073750A2 (en) Voice-interactive marketplace providing time and money saving benefits and real-time promotion publishing and feedback
US8131555B1 (en) System and method for funneling user responses in an internet voice portal system to determine a desired item or service
AU2001247456A1 (en) System and method for voice access to internet-based information
WO2001071609A2 (en) System and method for advertising with an internet voice portal
WO2001050453A2 (en) Interactive voice response system
Duggan Revenue Opportunities in the Voice Enabled Web
Duggan Strategies for Enterprise Voice Enabled Web Projects
JP2002279199A (en) Electronic reception system and method therefor

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWE Wipo information: entry into national phase

Ref document number: 2404343

Country of ref document: CA

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP