United States Patent (10) Patent No.: US 9,620,105 B2 Mason (45) Date of Patent: Apr

United States Patent (10) Patent No.: US 9,620,105 B2 Mason (45) Date of Patent: Apr

US0096.20105B2 (12) United States Patent (10) Patent No.: US 9,620,105 B2 Mason (45) Date of Patent: Apr. 11, 2017 (54) ANALYZING AUDIO INPUT FOR EFFICIENT 3,704,345 A 11/1972 Coker et al. SPEECH AND MUSIC RECOGNITION 3,710,321. A 1/1973 Rubenstein 3,828,132 A 8/1974 Flanagan et al. (71) Applicant: Apple Inc., Cupertino, CA (US) 4,013,0853,979,557 A 3/19779, 1976 SchulmanWright et al. 4,081,631 A 3, 1978 Feder (72) Inventor: Henry Mason, San Francisco, CA (US) (Continued) (73) Assignee: Apple Inc., Cupertino, CA (US) FOREIGN PATENT DOCUMENTS (*) Notice: Subject to any disclaimer, the term of this CH 681573 A5 4, 1993 patent is extended or adjusted under 35 CN 1673939 A 9, 2005 U.S.C. 154(b) by 149 days. (Continued) (21) Appl. No.: 14/500,740 OTHER PUBLICATIONS (22) Filed: Sep. 29, 2014 “Top 10 Best Practices for Voice User Interface Design” available O O at <http://www.developer.com/voice/article.php/1567051/Top-10 (65) Prior Publication Data Best-Practices-for-Voice-UserInterface-Design.htm>, Nov. 1, 2002, US 2015/0332667 A1 Nov. 19, 2015 4 pages. Related U.S. Application Data (Continued)Continued (60) Provisional application No. 61/993,709, filed on May 15, 2014. Primary Examiner — Susan McFadden (74) Attorney, Agent, or Firm — Morrison & Foerster (51) Int. Cl. LLP GIOL 25/8 (2013.01) GOL 5/02 (2006.01) GIL 25/03 (2013.01) (57) ABSTRACT (52) U.S. Cl. CPC .............. G10L 15/02 (2013.01); G10L 25/03 E.It isResis (2013.01); G 10L 25/812015/025 (2013.01); (2013.O1 G 10L process, an audio input can be received. A determination can ( .01) be made as to whether the audio input includes music. In (58) Field of Classification Search addition, a determination can be made as to whether the CPC ................................ G10L 15/02; G 10L 25/81 audio input includes speech. In response to determining that USPC ...... 704/249 the audio input includes music, an acoustic fingerprint See application file for complete search history. representing a portion of the audio input that includes music is generated. In response to determining that the audio input (56) References Cited includes speech rather than music, an end-point of a speech U.S. PATENT DOCUMENTS utterance of the audio input is identified. 1,559,320 A 10, 1925 Hirsh 2,180,522 A 11/1939 Henne 60 Claims, 4 Drawing Sheets Process RECEIVEAUDONPUT 101 DETERMINE WHETHER AUDIONPUT DETERMINE WHETHERAUDIO (NPUT IMCLUDESMUSIC INCLUDESSPEECH 13 13 CEASETO GENERATE DETERMINE PROCESSAUDIO ACOUSTC WHETHERAUDIO NPUTFOR FINGERPRINT NPUTMCLUDES SPEECH SPEECH 11 109 IDENTIFy NFERREDUSER END-POINT INTENTFROMSPEECH OFSPEECH INCLUDESIDENTIFYING MUSICR118 13 CEASE TO CEASETOGENERATEACUSTIC RECEIVE FINGERPRINT AUDINPUT 2 118 US 9,620,105 B2 Page 2 (56) References Cited 4,862.504 8, 1989 Nomura 4,875,187 10, 1989 Smith U.S. PATENT DOCUMENTS 4,878,230 10, 1989 Murakami et al. 4,887.212 12, 1989 Zamora et al. 4,090,216 5, 1978 Constable 4,896.359 1, 1990 Yamamoto et al. 4,107,784 8, 1978 Van Bemmelen 4,903,305 2, 1990 Gillick et al. 4,108,211 8, 1978 Tanaka 4.905,163 2, 1990 Garber et al. 4,159,536 6, 1979 Kehoe et al. 4,908,867 3, 1990 Silverman 4,181,821 1, 1980 Pirz et al. 4.914,586 4, 1990 Swinehart et al. 4,204,089 5, 1980 Key et al. 4.914,590 4, 1990 Loatman et al. 4,241,286 12, 1980 Gordon 4,918,723 4, 1990 Iggulden et al. 4,253,477 3, 1981 Eichman 4,926,491 5, 1990 Maeda et al. 4.278,838 T. 1981 Antonov 4,928.307 5, 1990 Lynn 4,282.405 8, 1981 Taguchi 4,935,954 6, 1990 Thompson et al. 4,310,721 1, 1982 Manley et al. 4,939,639 7, 1990 Lee et al. 4,332,464 6, 1982 Bartulis et al. 4.941,488 7, 1990 Marxer et al. 4,348,553 9, 1982 Baker et al. 4,944,013 7, 1990 Gouvianakis et al. 4,384,169 5, 1983 Mozer et al. 4,945,504 7, 1990 Nakama et al. 4,386,345 5, 1983 Narveson et al. 4,953, 106 8, 1990 Gansner et al. 4.433,377 2, 1984 Eustis et al. 4,955,047 9, 1990 Morganstein et al. 4.451,849 5, 1984 Fuhrer 4,965,763 10, 1990 Zamora 4.485439 11, 1984 Rothstein 4,972.462 11, 1990 Shibata 4,495,644 1, 1985 Parks et al. 4,974,191 11, 1990 Amirghodsi et al. 4.513,379 4, 1985 Wilson et al. 4,975,975 12, 1990 Filipski 4,513.435 4, 1985 Sakoe et al. 4,977,598 12, 1990 Doddington et al. 4,542,525 9, 1985 Hopf 4,980,916 12, 1990 Zinser 4,555,775 11, 1985 Pike 4,985,924 1, 1991 Matsuura 4,577,343 3, 1986 Oura 4,992,972 2, 1991 Brooks et al. 4,586,158 4, 1986 Brandle 4,994,966 2, 1991 Hutchins 4,587,670 5, 1986 Levinson et al. 4,994,983 2, 1991 Landell et al. 4,589,022 5, 1986 Prince et al. 5,003,577 3, 1991 Ertz et al. 4,611,346 9, 1986 Bednar et al. 5,007,095 4, 1991 Nara et al. 4,615,081 10, 1986 Lindahl 5,007,098 4, 1991 Kumagai 4,618,984 10, 1986 Das et al. 5,010,574 4, 1991 Wang 4,642,790 2, 1987 Minshull et al. 5,016,002 5, 1991 Levanto 4,653,021 3, 1987 Takagi 5,020, 112 5, 1991 Chou 4,654,875 3, 1987 Srihari et al. 5,021,971 6, 1991 Lindsay 4,655,233 4, 1987 Laughlin 5,022,081 6, 1991 Hirose et al. 4,658.425 4, 1987 Julstrom 5,027,110 6, 1991 Chang et al. 4,670,848 6, 1987 Schramm 5,027.406 6, 1991 Roberts et al. 4,677,570 6, 1987 Taki 5,027,408 6, 1991 Kroeker et al. 4,680,429 7, 1987 Murdock et al. 5,029,211 7, 1991 Ozawa 4,680,805 7, 1987 Scott 5,031,217 7, 1991 Nishimura 4,688,195 8, 1987 Thompson et al. 5,032,989 7, 1991 Tornetta 4,692,941 9, 1987 Jacks et al. 5,033,087 7, 1991 Bahl et al. 4,698.625 10, 1987 McCaskill et al. 5,040.218 8, 1991 Vitale et al. 4,709,390 11, 1987 Atal et al. 5,046,099 9, 1991 Nishimura 4,713,775 12, 1987 Scott et al. 5,047,614 9, 1991 Bianco 4,718,094 1, 1988 Bahl et al. 5,050,215 9, 1991 Nishimura 4,724,542 2, 1988 Williford 5,053,758 10, 1991 Cornett et al. 4,726,065 2, 1988 Froessl 5,054,084 10, 1991 Tanaka et al. 4,727,354 2, 1988 Lindsay 5,057,915 10, 1991 Von Kohorn 4,736,296 4, 1988 Katayama et al. 5,067,158 11, 1991 Arjmand 4,750,122 6, 1988 Kaji et al. 5,067,503 11, 1991 Stile 4,754,489 6, 1988 Bokser 5,072,452 12, 1991 Brown et al. 4,755,811 T. 1988 Slavin et al. 5,075,896 12, 1991 Wilcox et al. 4,776,016 10, 1988 Hansen 5,079,723 1, 1992 Herceg et al. 4,783,804 11, 1988 Juang et al. 5,083,119 1, 1992 Trevett et al. 4,783,807 11, 1988 Marley 5,083,268 1, 1992 Hemphill et al. 4,785,413 11, 1988 Atsumi 5,086,792 2, 1992 Chodorow 4,790,028 12, 1988 Ramage 5,090,012 2, 1992 Kajiyama et al. 4,797,930 1, 1989 Goudie 5,091,790 2, 1992 Silverberg 4,802,223 1, 1989 Lin et al. 5,091,945 2, 1992 Kleijn 4,803,729 2, 1989 Baker 5,103,498 4, 1992 Lanier et al. 4,807,752 2, 1989 Chodorow 5,109,509 4, 1992 Katayama et al. 4,811,243 3, 1989 Racine 5,111,423 5, 1992 Kopec, Jr. et al. 4,813,074 3, 1989 Marcus 5,119,079 6, 1992 Hube et al. 4,819,271 4, 1989 Bahl et al. 5,122,951 6, 1992 Kamiya 4,827,518 5, 1989 Feustel et al. 5,123,103 6, 1992 Ohtaki et al. 4,827,520 5, 1989 Zeinstra 5,125,022 6, 1992 Hunt et al. 4,829,576 5, 1989 Porter 5,125,030 6, 1992 Nomura et al. 4,829,583 5, 1989 Monroe et al. 5,127,043 6, 1992 Hunt et al. 4,831,551 5, 1989 Schalk et al. 5,127,053 6, 1992 Koch 4,833,712 5, 1989 Bahl et al. 5,127,055 6, 1992 Larkey 4,833,718 5, 1989 Sprague 5,128,672 7, 1992 Kaehler 4,837,798 6, 1989 Cohen et al. 5,133,011 7, 1992 McKiel, Jr. 4,837,831 6, 1989 Gillick et al. 5,133,023 7, 1992 Bokser 4,839,853 6, 1989 Deerwester et al. 5,142.584 8, 1992 Ozawa 4,852,168 7, 1989 Sprague 5,148,541 9, 1992 Lee et al. US 9,620,105 B2 Page 3 (56) References Cited 5,333,266 T/1994 Boaz et al. 5,333,275 T/1994 Wheatley et al. U.S. PATENT DOCUMENTS 5,335,011 8, 1994 Addeo et al.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    57 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us