the newsletter of the International Computer Science Institute

The ICSI GAZETTE

volume two | issue two | march 2004

featured alum: Dan Jurafsky by John Tortorice

Dan Jurafsky, outstanding ICSI alum, has had many triumphs - but not everything works out in science, not even for him. His first setback in technology came before he, or many others, had even set eyes on a computer. “When I was about 12 years old (in the 1970s), I read a newspaper story about how IBM had invented the Talking Typewriter, and the speech problem was completely solved,” said Jurafsky. “I was upset! That’s the work I wanted to do. I’m not sure where I got the idea, probably watching TV shows like the Jetsons in this or Lost in Space.” Jurafsky, an associate professor of linguistics at issue: , made the newspapers himself when he was among the 2002 winners of the MacArthur Fellowship, commonly known as the “genius grant.” page 2 The prestigious fellowship is awarded to individuals As I See It across the sciences, arts and public policy whose work by Nelson Morgan, Director has demonstrated singular vision and makes a unique contribution to their field. page 3 News Briefs The MacArthur Foundation selection committee cited Jurafsky for his work establishing “the foundations page 4 for developing systems that use natural language to interact with people.” Jurafsky’s research is in the

EARS Diane Starr field of computational linguistics and focuses on using computers to model how people use language and Dan Jurafsky page 6 on getting computers to understand humans. He has Visiting Scholars worked to identify patterns in syntax that provide clues “The early 90s were discouraging,” Jurafsky recalls. “All to the underlying semantic structure of communication. of a sudden everything I knew went out the window. page 7 Natural language processing changed dramatically. Publications The MacArthur Foundation was not the only Before that it was based on logic, and after that it organization to acknowledge his work. Jurafsky, 40, became based on statistics. So I had all the statistics and has already received a Career Award from the National probability theory to learn.” Science Foundation and published a highly respected textbook on the subject of computational linguistics. One option was to take a break and teach. Jurafsky had been thinking about Hong Kong. Jerry Feldman, then No one, Jurafsky included, could have predicted this ICSI director, had other ideas. He saw a keen intellect level of achievement in the early 1990s, when he found and a great breadth of scientific work in the thesis. himself mired in the post-doc blues after earning his PhD in computer science from UC-Berkeley. With a “I was a reader on his thesis committee and he had some provocative, but admittedly “messy,” dissertation behind remarkably good things in it,” says Feldman. “I was very him, Jurafsky saw his field suddenly and dramatically impressed. It was deep in both CS and psycholinguistics. take a detour. He understood it as well as anyone.” Continued on Page 5 Howell Shaw as i see it by Nelson Morgan, Director news briefs

It’s New Year’s Day as I write this column; an systems for pattern recognition, and we needed to artificial boundary, to be sure, but nonetheless a choose an application area. I had worked in speech good time to reflect on the past and coming year. I as well, primarily in speech analysis and synthesis. was struck recently by the good spirit that is evident Hervé was filled with enthusiasm, both for the topic in small things around here – as in the fun and of statistically based , and for a games that the financial administrators put together set of new ideas about using connectionist systems for this year’s Winterfest party, the foosball games to generate probabilities that could be used in that I’ve seen the students and visitors play in the recognition systems. He and I began working together hall, the weekly movie night, and the Kapla blocks on this topic, and ultimately developed a number of in our reception area that the Networking folks use related approaches. Just as importantly, we developed so much (and which the Speech researchers seem to a strong working relationship that was the core of the periodically knock down). As is often observed in partnership that exists today between ICSI and the basic research environments, great science can be Institute he now directs (IDIAP in Switzerland). accompanied by seemingly unrelated fun. I thought about the spirit of ICSI that results from these two A few years after Hervé’s initial visit, a key contributor parts as I put together this blend of newsworthy to the development of speech science and technology items and reminiscences. at ICSI was Dan Jurafsky, the subject of our Alumnus Interview in this issue. Dan was a primary designer ICSI’s character changes slowly, despite the more (with Chuck Wooters and Gary Tajchman) for the rapid shifts in visitor population Berkeley Restaurant Project (BeRP), a mixed initiative and specific topics of study. The spoken query system allowing users to find restaurants “As is often observed study of Information Society in Berkeley. At the time he was a postdoctoral Fellow (covered by the new BCIS at ICSI, but soon moved on to a faculty position at in basic research group) expanded to a range the University of Colorado. There he developed what of projects in 2003. As noted many consider to be the best textbook for natural environments, great in previous newsletters, this language processing. Last year, as noted in this science can be group’s work marks a radical Gazette, he received the MacArthur Foundation award change for ICSI, particularly (the so-called “Genius Grant”). He has continued to accompanied by great as its focus is at least as much work with ICSI researchers since his move to Colorado. fun.” societal as it is technological. This month he joined the Stanford faculty, and we are However, as in previous years, looking forward to many more collaborations with most shifts in 2003 were within him. groups that already existed. For instance, in the last few months, the Networking group began a In other ICSI news, this year we are welcoming new project in cybersecurity, and the Speech group several new members to our Board. Scott Shenker, began working on speaker identification. And in our Networking group leader and corporation Vice 2003, the Speech group also ramped up to a major President, has agreed to be a Trustee. Additionally, effort in the DARPA-funded Effective Affordable Shankar Sastry, current Chairman of the EECS Reusable Speech (EARS) program, the focus project Department of UC Berkeley, will join us, and has in this issue of the Gazette. We will also feature an already had a number of useful discussions with me interview with Dan Jurafsky, who had a key role in about improving the campus-ICSI relationship. Finally, earlier successes for the ICSI spoken language effort. we are very pleased to have recruited David Nagel to I want to say a few words about Dan, but thinking our Board. Dave was formerly the President of AT&T about his effect on us started me wandering a little Labs and CTO of AT&T, and currently is the CEO of bit further down ICSI’s memory lane … PalmSource, Inc., the software spin-off of Palm. He funded our networking work while at AT&T, and Speech processing (and particularly recognition) also has a strong understanding of human-machine has been a major research area at ICSI since its interface research, having led human factors research official inauguration in 1988. The catalyst for this at NASA in an earlier role. In addition to these new was Hervé Bourlard’s visit that year. At the time, my members, Trustee Cliff Higgerson of ComVentures has own focus was on the realization of connectionist agreed to be the new Chairman of our Board.

ICSI Gazette Page 2 ICSI Gazette Page 3 Howell Shaw as i see it by Nelson Morgan, Director news briefs

focused entirely on FrameNet, features HAYDEN GEORGE HODSON was born at 4:22 Researchers in the SPEECH GROUP as well p.m. on Friday, October 24, 2003 weighing as BCIS will be participating in a UC several articles by current and former ICSI in at 8.1 pounds and Berkeley project called “Information and researchers. 21 inches long. The Communication Technology for Billions” SVEN BEHNKE KLAUS WEHRLE proud parents are ICSI’s which aims to provide affordable computer ICSI alumni and XORP researcher, Orion technology for third world countries. were two of seven recipients of the Action Hodson and his wife, Plan in Computer Science award from the DFG, a German national research Heather. Hayden is the MICHAEL ELLIS has joined the ICSI SysAdmin foundation. The grants will facilitate the couple’s first child. Hayden Hodson Staff as its new Manager. DAVID JOHNSON will be moving to a new role leading foundation of a junior research group in Germany. infrastructure projects. SCOTT MCCOMAS also The Speech Group recently received started as a full time SysAdmin. a grant from NSF entitled “Modeling ICSI welcomed three new board members Idiosyncrasies in Speaking Behavior.” in 2004. SCOTT SHENKER, DAVID NAGEL AND JACI CONSIDINE joined the Admin staff as the BARBARA PESKIN will head a project to SHANKAR SASTRY new person at ICSI’s front desk. all started their service to improve speaker recognition. our board in January. CLIFF HIGGERSON has taken over Chairman of the Board duties JENNY NGUYEN joined the Accounting Researchers in ICSI’s Networking Group, ELWYN department and will be taking the place as well, as our former Chairman led by VERN PAXSON, will be participating BERLEKAMP has retired. of MARY PENILLA while Mary is out on in a $5.3 million NSF grant to fund maternity leave. cybersecurity research. They will be ICSI hosted an OPEN HOUSE on February collaborating with UC Davis, Pennsylvania ICSI has several new permanent staff 27 that included posters, demonstrations State University, and Purdue University. presented by researchers. NIKKI MIRGHAFORI, a former Ph.D. student at ICSI, has returned to work with ICSI Research VERN PAXSON has Staff, a brief the Speech Group. JOHN MOODY is working also been funded overview of with the Algorithms Group. Finally, MARK by NSF for a the Institute by ALLMAN is working with the ICSI Center for project entitled Internet Research (ICIR). We look forward director Nelson “Viable Network Morgan, and a to the contributions our new researchers Leah Hiitchcock Defense for will bring to ICSI’s research projects. lecture by Vern FrameNet Demonstration Scientific Research Paxson called “The Threat of Internet Institutions.” This Worms”. ICSI’s new Faculty Associate, ANNALEE project will focus SAXENIAN, of UCB’s City and Regional on improving RICHARD KARP Planning Department, has been appointed , leader of ICSI’s Algorithms

the Bro intrusion Diane Starr dean of UC Berkeley’s School of Group, has been selected to receive detection system. Vern Paxson Information Management and Systems. the 2004 Benjamin Franklin Medal in Computer and Cognitive Science for his The Artificial Intelligence Group has BFOIT, the Berkeley Foundation for contributions to the understanding of received a grant from DARPA to continue Opportunities in Information Technology, computational complexity. work on the FRAMENET project. has been awarded a grant from the SCOTT SHENKER Elizabeth and Stephen Bechtel, Jr. , leader of ICSI’s Networking SRINI NARAYANAN of the Artificial Foundation. Group, has been selected as an ACM Intelligence Group has been awarded a Fellow for his achievements in the field of subcontract from University of Texas information technology. ICSI’s FRAMENET project was featured at Dallas for work on an ARDA project in the September 2003 issue of the “Causal and Temporal Reasoning for LIZ SHRIBERG International Journal of Lexicography Speech Researcher in the news: , Question Answering Systems.” (Volume 16, Number 3). The publication, a senior researcher in ICSI’s Speech Group

Continued on Page 6

ICSI Gazette Page 2 ICSI Gazette Page 3 EARS Novel Approaches Staff (Left to Right): Nelson Morgan, Dave Gelbart, Qifeng Zhu, Barry Chen Leah Hitchcock Effective Affordable Reusable

Howell Shaw Speech-to-text: EARS

In 2002, DARPA initiated a major speech research program called EARS (Effective Affordable Reus- able Speech-to-text), which aims to radically improve speech recognition by machine using Novel Approaches (new and innovative methods) and Rich Transcription (improving word error rates and increasing readability and value to downstream processing through automatic punctuation and other forms of markup). Researchers at ICSI are working on both major components of the program.

Nelson Morgan is the Principal Investigator of the Novel Approaches frequency than is done in short-term spectral analysis, ICSI Novel Approaches project. The goal of the Novel Approaches aspect of EARS is and this is likely to contribute to the robustness Andreas Stolcke, of ICSI and SRI, is the Principal that is observed in human recognition of speech. In Investigator of the Rich Transcription Project. to examine closely the standard methods currently used for speech recognition, and to try to improve particular, ICSI is developing methods inspired by the Barbara Peskin is the site PI for the Rich apparent capabilities of the human system to take Transcription team and the co-lead of the upon them by figuring out which aspects of the Metadata effort. technology are limiting its improvement, and to advantage of temporal properties of speech, which are quite complementary to spectral measures. This is Liz Shriberg is the co-lead of the Metadata effort develop new methods to either replace the old ways with particular focus on sentence segmentation or use in conjunction with them. primarily being done in the context of improving the and disfluency detection, and incorporation of recognition of large vocabulary conversational speech, prosodic features. ICSI, in conjunction with SRI, University of which is one of the main goals of EARS. Chuck Wooters is working on diarization, which is the segmentation of the signal into individual Washington, Columbia University, and IDIAP, is speakers as well as other acoustic sources, and taking a perception-based approach to solving this Rich Transcription labeling these elements consistently. problem. Current speech recognition systems The Rich Transcription EARS project aims to make Nikki Mirghafori recently joined the Rich analyze very short (10-25ms) chunks of sound, transcripts generated by automatic speech recognition Transcription team and will be studying adaptation strategies for broadcast news data. computing some simple function of a short-term systems readable and provide extra value to both spectrum. From this analysis statistical models human and machine users of the data. There are Barry Chen is working on new front end processing techniques for automatic speech recognition, generate probabilities for what speech unit is being several specific areas for improvement. At the end involving using discriminative temporal patterns uttered at that time. Sequences of these probabilities of three years, word error rates (WER) for broadcast over half a second in narrow frequency channels. are combined according to possible hypotheses of news are targeted to improve from around 20% to Dave Gelbart is investigating feature selection what might have been said, along with probabilities 5%, and after five years, the WER for telephone for multi-stream speech recognition, to improve performance by taking into account the presence of each word given a hypothesized word history. All speech is targeted to improve from around 30% to of multiple streams during the feature selection together these probabilities are used to determine 5%. Additionally, the rate of transcription needs to process. the most likely word sequence. be improved so that it works in real time on 1 cpu (a Qifeng Zhu is working on the development of standard personal computer). plausible features to use for testing the new speech recognizers. While this is a logical approach that has led to many successes, it is also limited. In particular, Simply having an accurate string of words does not Yan Huang is working on speech recognition. In particular, she developed a broadcast news the short-term spectrum is extremely fragile with make a transcript readable, however. Some form recognizer for the Mandarin language, and is respect to sources of variability that have nothing of markup is needed to mark pauses, disfluencies, currently working on language modeling. to do with the linguistic message. ICSI is working punctuation, who is speaking, etc. All of the non-word Yang Liu and Jeremy Ang are working on metadata to transcend this limitation by looking at the features providing auxiliary information to make extraction. In particular, Yang is working on sentence segmentation and disfluency detection, problem from the point of view of how humans the transcript readable are called Metadata. ICSI’s and Jeremy is providing data analysis. recognize speech. The window of time that current participation in the SRI-led Rich Transcription project,

Sebastien Coquoz is working on using feedback recognizers use for analysis is much shorter than a collaboration between SRI, ICSI, and the University from metadata extraction to improve word error anything that the human ear can hear and make of Washington, is focusing primarily on metadata rates, and vice versa. sense of. Humans recognize entire syllables and extraction, though it includes speech transcription even words all at once; therefore, a key part of the work as well. Metadata extraction includes cues for ICSI Novel Approach is to use longer chunks of speaker segmentation and tracking, disfluency and time that correspond to about 1-2 syllables (about sentence boundary detection, and incorporation of half a second). More generally, the human auditory other prosodic cues. system appears to compute a much broader range of functions of the input signal over time and Rich Transcription has the added challenge of adapting the technology for use in other languages, EARS Rich Transcription Staff including Mandarin and Arabic. Back row (Left to Right): Chuck Wooters, Liz Shriberg, Jeremy Ang, Yan Huang, Yang Liu, Sebastien Coquoz Front row (Left to Right): Nikki Mirghafori, Andreas Stolcke, Barbara Peskin Adam Janin

ICSI Gazette Page 4 ICSI Gazette Page 5 featured alum: Dan Jurafsky

Continued from Page 1

The invitation to join ICSI was a turning point. MacArthur Fellowship notwithstanding, Jurafsky “Jerry said, ‘Take a postdoc at ICSI for a year, and faces the same challenges as any college teacher. “I you can teach in Hong Kong next year,’” Jurafsky taught my first freshman course this semester and remembers. The postdoc lasted for three years and it was intense, dealing with teenagers. They are was pivotal for Jurafsky and his field of study. “My focused on fairness. ‘If it’s not fair, it’s your fault. time at ICSI was very important. This was the key Why didn’t you tell us that question was going to intellectual learning experience for me. Half of what be on the test?’ I wasn’t prepared for that.” I need to know in my field I learned at ICSI doing my postdoc.” Nelson Morgan noted, “while doing Not every student is destined for a stellar science his postdoc at ICSI, Dan was a key contributor in career and dealing with that realization can be the Berkeley Restaurant Project (called “BeRP”), difficult. “The worst part of the job is dealing with which produced a spoken dialog system for getting students who aren’t doing well. Grad students restaurant information in the city of Berkeley. Dan having trouble with their dissertation, undergrads impressed us all immensely despite his modesty.” not getting it. They’re doing something they’re bad at. They don’t like what they’re doing.” Jurafsky has been working at the intersection of computer science and linguistics for some time. But the flip side, nurturing understanding in “My first computer actually was a programmable promising minds, calculator my math teacher had when I was in 7th can make up for grade. The first real computer I worked on was a those difficult times. PDP-8. DEC was giving away computers to high “The best part of the schools and everyone worked on them. They used job is also working “When I was about 12 paper-tape.” with students. They years old, I read a newspa- come to you and per story about how IBM While he was in high school a friend of his was say, ‘Remember that had invented the Talking interested in linguistics and Dan caught the bug, problem I was having?’ reading all the books on the subject he could find. and they show you Typewriter, and the speech He also studied languages, including German and how they solved it. problem was completely French in high school, and Hebrew as part of his And you’re happy solved. I was upset! That’s religious training. and they’re happy the work I wanted to do. because something has I’m not sure where I got the With feet firmly implanted in both camps, Jurafsky dawned.” has some insights into the different approaches idea, probably watching TV favored by linguists and computer scientists. Universities across shows like the Jetsons or the world are full Lost In Space.” “Linguists like to work away with the data. Which of talented teachers is good. They try to get at the really deep answer and researchers. about what’s really going on. But sometimes What makes Jurafsky it’s hard to see the forest for the trees. They different? Feldman sees get the exact answer to a small problem, rather three things. First, there is the breadth of his work, than finding an approximate answer to a larger which eclipses academic boundaries. Next, there is a problem.” seriousness to his science. Instead of just finding a solution, Jurafsky goes after a deeper explanation. “Computer scientists never want to look at the data. ‘I’m going to build an algorithm and if it doesn’t And lastly, “… he’s extremely smart,” said Feldman. work, I’m going to tweak it.’ But the math is good. “As a postdoc, he took the lead in the language and They understand the probabilistic model.” speech group, without any background in speech.”

ICSI Gazette Page 4 ICSI Gazette Page 5 visiting scholars

Since its inception, ICSI has had a strong inter- From Switzerland national program, consisting primarily of ties with Sebastien Coquoz (Speech) specific countries. Vincenzo Pallotta (FrameNet)

Current formal agreements continue with Finland, In addition, we often have visitors associated Germany, Spain, and Switzerland. with specific research and projects. From France From Finland Brigitte Bigi (Speech) Andrei Gurtov (Networking) Pekka Himanen (BCIS) From Israel Konsta Koppinen (Speech) Robi Krauthgamer (Algorithms)

Leah Hitchcock Vesa-Matti Lahti (BCIS) Sebastien Coquoz Ron Shamir (Algorithms) Olli Leppanen (BCIS) Roded Sharan (Algorithms) Kira Lopperi (Haas) Maarit Makinen (BCIS) ICSI is also involved in collaborative research with col- Tuomo Pirinen (Speech) leagues at other American institutions. Jusso Rantala (Haas) From Boston University Alberto Medina (ICIR) From Germany Sven Behnke (Speech) From UCB Jens Gramm (Algorithms) Larissa Muller (BCIS) Till Nierhoff (Algorithms) Joyojeet Pal (BCIS) Robin Sommer (Networking)

Diane Starr Kannan Ramchandran (Qualcomm) Till Tantau (Algorithms) Vincenzo Pallotta Oliver Wendt (Algorithms) From UCSD Britta Wrede (Speech) Sriram Rambhadran (ICIR)

From Spain From Stanford Javier Cardona (Networking) Balaji Prabhakar (Algorithms) Chema Gonzalez (Networking) Devavrat Shah (Algorithms) Javier Macias (Speech) Sira Palazuelos (FrameNet) Diane Starr Brigitte Bigi

News Briefs: (cont.) LIZ SHRIBERG and ANDREAS STOLCKE hosted a meeting for a new ITR project on January 16, 2004 at ICSI.

was quoted in the New York Times on January 3 NICK WEAVER, of the Networking Group, was quoted in regarding disfluencies in spontaneous speech. the New York Times magazine on February 8, 2004 regarding his research on Internet Worms. The ICSI MEETING CORPUS has been released by the Linguistic Data Consortium. For more information on ICSI News Briefs, see: http://www.icsi.berkeley.edu/Events/newsbriefs.html February 2004 marks the 10th anniversary of ICSI’S WEBSITE . Diane Starr Nick Weaver

ICSI Gazette Page 6 ICSI Gazette Page 7 publications listing

J. AJMERA AND C. WOOTERS. A Robust Speaker Glossary, International Journal of Lexicography, Traffic, Proc. IEEE Symposium on Security and Clustering Algorithm. Proc. IEEE Speech Vol 16.3: 359-362. 2003. Privacy, May 2003. Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands, Dec. 2003. S. FLOYD. Limited Slow-Start for TCP with Large R. SHARAN, T. IDEKER, B. KELLEY, R. SHAMIR AND R. M. Congestion Windows. Internet draft, work in KARP. Identification of Protein Complexes by M. ALLMAN, W. EDDY, AND S. OSTERMANN. Estimating progress, July 2003. Comparative Analysis of Yeast and Bacterial Loss Rates With TCP. ACM Performance Protein Interaction Data. ICSI Technical Evaluation Review, 31(3), December 2003. S. FLOYD AND J. KEMPF, Editors. IAB Concerns Report, 2003. Regarding Congestion Control for Voice M. ALLMAN. On the Performance of Middleboxes. Traffic in the Internet. Internet draft, work in E. SHRIBERG AND A. STOLCKE. Direct Modeling ACM SIGCOMM/Usenix Internet Measurement progress, October 2003. of Prosody: An Overview of Applications in Conference, Miami, FL, USA, October 2003. Automatic Speech Processing. To appear in S. FLOYD AND R. MAHAJAN. Router Primitives for Proc. International Conference on Speech S. ATKINS, M. RUNDELL AND H. SATO. The Protection Against High-Bandwidth Flows Prosody 2004, Nara, Japan. Contribution of FrameNet to Practical and Aggregates. Internet-draft-to-be, work in Lexicography. International Journal of progress, April 2003. E. SHRIBERG AND A. STOLCKE. Prosody Modeling Lexicography, Vol 16.3: 333-357, 2003. for Automatic Speech Recognition and S. GREENBERG, H. CARVEY, L. HITCHCOCK AND S. CHANG. Understanding. To appear in Mathematical S. ATKINS, C. J. FILLMORE AND C. R. JOHNSON. Temporal properties of spontaneous speech- Foundations of Speech and Language Lexicographic Relevance: Selecting Information a syllable-centric perspective. Journal of Modeling, M. Johnson, M. Ostendorf, S. From Corpus Evidence, International Journal of Phonetics, 31, 2003. Khudanpur, R. Rosenfeld (eds.), Volume 138 Lexicography, Vol 16.3: 251-280, 2003. in IMA Volumes in Mathematics and its A. GURTOV AND S. FLOYD. Modeling Wireless Links Applications, pp. 105-114, Springer-Verlag. C. F. BAKER, C. J. FILLMORE AND B. CRONIN. for Transport Protocols. November 2003, The Structure of the FrameNet Database, under submission. P. SOMERVUO, B. CHEN AND Q. ZHU. Feature International Journal of Lexicography, Vol 16.3: Transformations and Combinations for 281-296, 2003. R. HUEBSCH, J. M. HELLERSTEIN, N. LANHAM, B. T. LOO, Improving ASR Performance. EUROSPEECH S. SHENKER AND I. STOICA. Querying the Internet 2003, Geneva, September 2003. C. F. BAKER AND H. SATO. The FrameNet Data with PIER. VLDB, 2003. and Software. Poster and Demonstration at R. SOMMER AND V. PAXSON. Enhancing Byte-Level Association for Computational Linguistics, T. KELLY, S. FLOYD AND S. SHENKER. Patterns of Network Intrusion Detection Signatures with Sapporo, Japan, 2003. Congestion Collapse. Under submission, June Context, Proc. ACM CCS 2003. 2003. B. CHEN, S. CHANG AND S. SIVADAS. Learning N. WEAVER, J. HAUSER, AND J. WAWRZYNEK. “The Discriminative Temporal Patterns in Speech: D. MOORE, V. PAXSON, S. SAVAGE, C. SHANNON, S. SFRA, a Corner-Turn FPGA Architecture”, Development of Novel TRAPS-Like Classifiers. STANIFORD AND N. WEAVER. Inside the Slammer Proceedings of the ACM Workshop on Field EUROSPEECH 2003, Geneva, September 2003. Worm. Security and Privacy, July/August 2003. Programmable Gate Arrays (FPGA), Feb 2004.

R. DHILLON, S. BHAGAT, H. CARVEY AND E. SHRIBERG. N. MORGAN, B. Y. CHEN, Q. ZHU AND A. STOLCKE. N. WEAVER, V. PAXSON, S. STANIFORD AND R. Meeting Recorder Project: Dialog Act Labeling Scaling Up: Learning Large-scale Recognition CUNNINGHAM. A Taxonomy of Computer Worms, Guide. ICSI Technical Report. Methods from Small-scale Recognition Tasks. Proc. ACM CCS Workshop on Rapid Malcode, ICSI Technical Report, 2003. October 2003. L. DOCIO-FERNANDEZ, D. GELBART AND N. MORGAN. Far-field ASR on Inexpensive Microphones. S. NARAYANAN, S. MCILRAITH. Analysis and N. WEAVER, V. PAXSON, S. STANIFORD AND R. EUROSPEECH 2003, Geneva, September 2003. Simulation of Web Services. Computer CUNNINGHAM. Large Scale Malicious Code: A Networks. 2003. Research Agenda. DARPA-sponsored report, A. FARIA. Pitch-based Vocal Tract Length 2003. Normalization. ICSI Technical Report, 2003. R. PANG AND V. PAXSON. A High-level Programming Environment for Packet Trace B. WREDE AND E. SHRIBERG. The Relationship J. FELDMAN AND S. NARAYANAN. EMBODIMENT IN A Anonymization and Transformation, Proc. Between Dialogue Acts and Hot Spots in NEURAL THEORY OF LANGAUGE. Brain and Language, ACM SIGCOMM 2003, August 2003. Meetings. Proc. IEEE Speech Recognition and 2003. Understanding Workshop, St. Thomas, U.S. Virgin E. SEGAL AND R. SHARAN. A Discriminative Model Islands, Dec. 2003. C. J. FILLMORE, C. R. JOHNSON AND M.R.L. PETRUCK. for Identifying Spatial cis-Regulatory Modules. Background to FrameNet, International Journal ICSI Technical Report, 2003. B. WREDE AND E. SHRIBERG. Spotting “Hot Spots” of Lexicography, Vol 16.3: 235-250, 2003. in Meetings: Human Judgements and Prosodic U. SHANKAR AND V. PAXSON. Active Mapping: Cues. EUROSPEECH 2003, Geneva, September C. J. FILLMORE AND M.R.L. PETRUCK. FrameNet Resisting NIDS Evasion Without Altering 2003.

ICSI Gazette Page 6 ICSI Gazette Page 7 � � � � � � � � � � � � � THE INTERNATIONAL COMPUTER intelligence, particularly for applications The ICSI Gazette is published twice a � � � � � � � � � � � � � � � SCIENCE INSTITUTE (ICSI) is the only to natural language understanding; and year by the International Computer Sci- � � � � � � � � � independent, non-profit US lab conduct- natural speech processing. ence Institute (ICSI). All material in The ing open, non-proprietary, pre-competi- ICSI Gazette is copyrighted by ICSI and tive research in computer science. Af- BOARD OF TRUSTEES should not be copied without explicit filiated with the University of California Charlie Bass prior written permission. campus in Berkeley, ICSI’s mission is Hervé Bourlard to further research in computer science Beth Burnside EDITOR-IN-CHIEF through international collaboration, Jordan Cohen Nelson Morgan and further international collaboration Adele Goldberg SENIOR EDITOR through research in computer science. Greg Heinzinger Maria Quintana ICSI provides a haven for computer sci- Clifford Higgerson DESIGN ence researchers to conduct concentrated Richard Karp envision ID efforts towards long-term goals without Pedro Lizcano LAYOUT commercial limitations and with few Jitendra Malik Leah Hitchcock faculty pressures. ICSI has significant Nelson Morgan COPYWRITERS efforts in four major research areas: David Nagel John Tortorice Internet research, including Internet ar- Ilpo Reitmaa Leah Hitchcock chitecture, related theoretical questions, Shankar Sastry PHOTOGRAPHY and network services and applications; Scott Shenker Howell Shaw theoretical computer science, including Wolfgang Wahlster Diane Starr applications to bioinformatics; artificial

INTERNATIONAL COMPUTER SCIENCE INSTITUTE

1947 Center Street Suite 600 Berkeley, CA 94704 USA v 510 666 2900 www.icsi.berkeley.edu