United States Patent (10) Patent N0.: US 8,498,871 B2 Miglietta Et A]
Total Page:16
File Type:pdf, Size:1020Kb
US008498871B2 (12) United States Patent (10) Patent N0.: US 8,498,871 B2 Miglietta et a]. (45) Date of Patent: Jul. 30, 2013 (54) DYNAMIC SPEECH RECOGNITION AND (56) References Cited TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS U.S. PATENT DOCUMENTS 4,829,576 A 5/1989 Porter (75) Inventors: Joseph H. Miglietta, Scottsdale, AZ 4,914,704 A 4/1990 Cole et al. 5,031,113 A 7/1991 Hollerbauer (US); Michael K. Davis, Phoenix, AZ 5,220,611 A 6/1993 Nakamura et a1. (Us) 5,355,472 A 10/1994 Lewis 5,572,643 A 11/1996 Judson (73) Assignee: Advanced Voice Recognition Systems, (Continued) Inc., Scottsdale, AZ (US) FOREIGN PATENT DOCUMENTS ( * ) Notice: Subject to any disclaimer, the term of this EP 1 136 983 Al 9/2001 patent is extended or adjusted under 35 OTHER PUBLICATIONS U.S.C. 154(b) by 247 days. Notice of Publication issued Oct. 29, 2009 in US. Appl. No. (21) Appl. No.: 13/115,105 12/497,675. (Continued) Filed: May 24, 2011 (22) Primary Examiner * Vijay B ChaWan (65) Prior Publication Data (74) Attorney, Agent, or Firm * Lee G. Meyer, Esq.; Meyer & Associates, LLC US 2011/0224981A1 Sep. 15,2011 (57) ABSTRACT A system for facilitating free form dictation, including directed dictation and constrained recognition and/or struc Related US. Application Data tured transcription among users having heterogeneous proto (63) Continuation-in-part of application No. 12/497,675, cols for generating, transcribing, and exchanging recognized ?led on Jul. 5, 2009, noW Pat. No. 7,949,534, Which is and transcribed speech. The system includes a system trans a continuation of application No. 11/824,794, ?led on action manager having a “system protocol,” to receive a Jul. 3, 2007, noW Pat. No. 7,558,730, Which is a speech information request from an authorized user. The continuation of application No. 09/996,849, ?led on speech information request is generated using a user interface Nov. 27, 2001, noW abandoned. capable of bi-directional communication With the system transaction manager and supporting dictation applications. A (51) Int. Cl. speech recognition and/or transcription engine (ASR), in G01L 21/00 (2006.01) communication With the systems transaction manager, (52) US. Cl. receives the speech information request generates a tran USPC .......... .. 704/270.1; 704/9; 704/270; 704/254; scribed response, and transmits the response to the system 704/10; 704/257; 709/228; 709/230; 719/328 transaction manager. The system transaction manager routes (58) Field of Classi?cation Search the response to one or more of the users. In another embodi USPC ................ .. 704/235, 270, 270.1, 9, 243, 254, ment, the system employs a virtual sound driver for streaming 704/10, 272, 252, 257; 709/228, 230; 719/328 free form dictation to any ASR. See application ?le for complete search history. 17 Claims, 4 Drawing Sheets R 90 usEsaAPPucAnm U55 g0 SEEVICEADAPTER 96 f” 100 / ppzmpwLNEW J APPLICATION 22' PROGRAM INTERFACE I USER LEGACY ROUTING same PROTOCOL A9‘ ADAPTER INTERFACE 88 pAseiHToueu PROTOCOL INTERFACE 94 247$ 92 98 —102 a‘ 404 as?“ AUTHORIZATION _11 2 ABRAPPLICATION W110“ - smrezcoewnoue 116 SERWCEADAWER MANAGER will; 122 mmscmmoumeme 30, 1 10 LEGACY | l INTERFACEAEA m1" PROTOCOL NEW m APPLICATION 124 PROTOCOL wee/01 INTERFACE SPEECH UNJFORM 118 ASA SERVICE 5Y$TEH PASSTHIZOUGH ADAPTER PROTOCOL \NTEEFACE 32' 126 as 1 14 US 8,498,871 B2 Page 2 US. PATENT DOCUMENTS Information disclosure Statement ?led Jul. 20, 2007 in US . Appl. No. 11/824,794. 5,799,273 A 8/1998 Mitchell et al. Notice of Publication issued Oct. 25, 2007 in US. Appl. No. 5,960,447 A 9/1999 Holt et al. 6,167,395 A 12/2000 Becketal. 11/824,794. 6,170,011 B1 1/2001 MacleodBecket al. Final Rejection issued Dec. 27, 2007 in US. Appl. No. 11/824,794. 6,292,480 B1 9/2001 May Examiner Interview Summary issued Apr. 9, 2008 in US. Appl. No. 6,298,326 B1 10/2001 Feller 11/824,794. 6,311,159 B1 10/2001 Van Tichelen etal. Letter to Applicant issued Apr. 14, 2009 in US. Appl. No. 6,434,526 B1 8/2002 CilurZo 11/824,794. 6,449,588 B1* 9/2002 Bowman-Amuah .......... .. 703/21 Amendment under 37 CFR 1.111 ?led Sep. 15, 2008 in US. Appl. 6,748,361 B1 6/2004 Comerford et a1. No. 11/824,794. 7,137,126 B1* 11/2006 Coffman et al. ............ .. 719/328 Notice of Allowance issued Mar. 6, 2009 in US. Appl. No. 7,606,712 B1* 10/2009 Smith et al. .............. .. 704/2701 11/824,794. 2002/0143874 A1 10/2002 Marquette et al. Issue Noti?cation issued Jun. 17, 2009 in US. Appl. No. 11/824,794. 2002/0156900 A1 10/2002 Marquette et al. Information Disclosure Statement ?led Jun. 4, 2002 in US. Appl. No. 2002/0184373 A1* 12/2002 Maes .......................... .. 709/228 09/996,849. 2003/0088421 A1* 5/2003 Maes et al. 704/270.1 Non-Final rejection issued Jul. 15, 2004 in US. Appl. No. 2006/0041431 A1* 2/2006 Maes ........ .. 704/270.1 09/996,849. 2007/0043574 A1* 2/2007 Coffman et al. ............ .. 704/275 Amendment under 37 CFR 1.111 ?led Mar. 11, 2005 in US. Appl. OTHER PUBLICATIONS No. 09/996,849. Abandonment issued Oct. 6, 2005 in US. Appl. No. 09/996,849. Non-Final Rejection issued Aug. 23, 2010 in US. Appl. No. Petition to Revive ?led Oct. 12, 2005 in US. Appl. No. 09/996,849. 12/497,675. Petition Decision issued Nov. 16,2005 in US. Appl. No. 09/996,849. Final Rejection issued Feb. 14, 2006 in US. Appl. No. 09/996,849. Andrew S. Tanenbaum, Computer Networks, Vrije University, Amendment under 37 CFR 1.116 ?led May 15, 2006 in US. Appl. Amsterdam, The Netherlands, 1981 (Aug. 23, 2010 in US. Appl. No. No. 09/996,849. 12/497,675). Examiner Interview Summary issued May 26, 2006 in U. S. Appl. No. Internetworking Technologies Handbook, The X.25 Protocol Suite 09/996,849. (Aug. 23, 2010 in US. Appl. No. 12/497,675). Advisory Action issued Jun. 26, 2006 in US. Appl. No. 09/996,849. Marc E. FiucZynski,The Design and Implementation of IPv6/ Final Rejection issued Jul. 18, 2006 in US. Appl. No. 09/996,849. 1Pv4 . , USENIX Annual Tech. Conf., Jun. 1998(Aug. 23, 2010 in Amendment under 37 CFR 1.116 ?led Sep. 25, 2006 in US. Appl. US. Appl. No. 12/497,675). No. 09/996,849. Amendment under 37 CFR 1.111 ?led Dec. 22, 2010 in US. Appl. Advisory Action issued Oct. 20, 2006 in US. Appl. No. 09/996,849. No. 12/497,675. Request for Continued Examinaiton ?led Dec. 18, 2006 in U. S. Appl. Notice of Allowance issued Jan. 13, 2011 in US. Appl. No. No. 09/996,849. 12/497,675. Final Rejection issued Apr. 3, 2007 in US. Appl. No. 09/996,849. Issue Noti?cation issued May 4, 2011 in US. Appl. No. 12/497,675. Abandonment issued Nov. 16, 2007 in US. Appl. No. 09/996,849. Notice ofNew Revised Publication Date issued Jul. 19, 2007 in US. Appl. No. 11/824,794. * cited by examiner US. Patent Jul. 30, 2013 Sheet 1 of4 US 8,498,871 B2 SPEECH RECOGNITION & TRANSCRIPTION USER INTERFACE APPLICATION NATIVE APPLICATION PROTOCOL 1—54 NATIVE COMMUNICATION 1_56 1_ PROTOCOL APPLICATION SERVICE ADAPTER L, 8_2’ USER SERVICE ADAPTER UNIFORI'I SYSTEM PROTOCOL 158 TRANSPORT LAYER I—Q FIG. 4 US. Patent Jul. 30, 2013 Sheet 3 of4 US 8,498,871 B2 US. Patent Jul. 30, 2013 Sheet 4 of4 US 8,498,871 B2 GYGTEM TRANGAOTION MANAGER TRANGPORT LAYER L80 L32 uNIEORM GYGTEM PROTOOOL LAYER L94 APPLIOATION cORREcTIONIGT TRANGORIPTION L98 30., PORTAL PORTAL 186 PORTAL —— WORKFLOW cOMPoNENT 1—90 JOB LOGIO LAYER 1-92 9 1_96 DATA ACCESS LAYER H <3 DATA GOLIROE is TAGI< MANAGER I OuEuED JOB BINS ‘2&0 Q2 POGT PROOEGGING MANAGER POsT PROOEGG API LAYER -2-0—4 FIG 5 SPEECH RECOGNITION & TRANSCRIPTION GERvER JOB OuEuE g PIPELINE MANAGER L21 WORKFLOW PIPELINE % GPEEOH GERvIcE ADAPTER @ 220 TRANGPORT LAYER Q UNIFORM SYSTEM PROTOOOL LAYER 2 AUDIO PREPROOEGG sERvIOE ADAPTER @ vENDOR ,@ VENDOR @ AUDIO E INDEPENDENT DEPENDENT < > PRE/POGTPROOEGG APE INTERFAOE APE INTEREAOE ENGINE WORKFLOW OONTROLLER @ AGR APPLIOATION GERvIOE ADAPTER 84. VENDOR 24o VENDOR 242 _ SPEECH 2 INDEPENDENT — DEPENDENT _ < > iEJICjOGNITION AGR INTEREAOE AGR INTEREAOE TRANGORIPTION (AGR) ENGINE FIG. 6 US 8,498,871 B2 1 2 DYNAMIC SPEECH RECOGNITION AND tures for users of the system, are usually more ef?cient than a TRANSCRIPTION AMONG USERS HAVING netWork of mere direct, point-to-point links betWeen indi HETEROGENEOUS PROTOCOLS vidual users. But universally available recognition databases, including CROSS-REFERENCE TO RELATED vocabulary databases and dictionaries, suffer from signi?cant APPLICATIONS ine?iciencies in facilitating communications betWeen users of a more centraliZed database system, especially if the dic The present application is a Continuation-In-Part Applica tation to be transcribed is “free form” or dynamic. Even tion ofU.S. application Ser. No. 12/497,675 ?led Jul. 5, 2009 though a recognition engine is very accurate in spoken Word for “SPEECH RECOGNITION AND TRANSCRIPTION (speech) recognition, the transcription may be ?lled With transcribed material Which is “out of context,” misinterpreted AMONG USERS HAVING HETEROGENEOUS PROTO or not formatted correctly. Simply stated, “garbage in-gar COLS,” now US. Pat. No. 7,949,534, Which is a Continua bage out.” tionApplication of US. application Ser. No. 11/824,794 ?led Thus, even though engine providers advertise in terms of Jul.