Java Generics Adoption: How New Features Are Introduced, Championed, Or Ignored

Java Generics Adoption: How New Features are Introduced, Championed, or Ignored Chris Parnin Christian Bird Emerson Murphy-Hill College of Computing Microsoft Research Dept. of Computer Science Georgia Institute of Redmond, Washington North Carolina %tate Technology [email protected] University Atlanta, Georgia Raleigh, North Carolina [email protected] [email protected] #ar too often, greatly heralded claims and visions of new language features fail to hold or persist in practice. Discus- sions of the costs and benefits of language features can easily 1. ABSTRACT devolve into a religious war with both sides armed with little Support for generic programming was added to the Java more than anecdotes (13]. .mpirical evidence about the language in 2004, representing perhaps the most significant adoption and use of past language features should inform change to one of the most widely used programming lan- and encourage a more rational discussion when designing guages today. Researchers and language designers antici- language features and considering how they should be de- pated this addition would relieve many long-standing prob- ployed. Collecting this evidence is not just sensible but a lems plaguing developers, but surprisingly, no one has yet responsibility of our community. measured whether generics actually provide such relief. In In this paper, we examine the adoption and use of generics, this paper, we report on the first empirical investigation into which were introduced into the Java language in 2004. When how Java generics have been integrated into open source Sun introduced generics, they claimed that the language software by automatically mining the history of 20 popular feature was 5a long-awaited enhancement to the type system” open source Java programs, traversing more than 500 million that 5eliminates the drudgery of casting.” Sun recommended lines of code in the process. "e evaluate five hypotheses, that programmers 5should use generics everywhere (they) can. each based on assertions made by prior researchers, about The extra e7orts in generifying code is well worth the gains how Java developers use generics. #or e$ample, our results in clarity and type safety.”1 But is it? suggest that generics do not significantly reduce the number Here, we take the first look at how features of Java gener- of type casts and that generics are usually adopted by a ics, such as type declarations, type-safe collections, generic single champion in a project, rather than all committers. methods, and wildcards, have been introduced and used in real programs. With the benefit of six years of hindsight, Categories and Subject Descriptors we investigate how the predictions, assertions, and claims D.3.' (Language Constructs and Features)* Data types that were initially made by both research and industry have and structures+ F.3.3 (Studies of Program Constructs)* played out in the wild. #urther, we investigate the course and ,ype structure timeline of adoption: what happens to old code, who buys in, how soon are features adopted, and how many projects and people ignore new features8 The results allow us to adjust General Terms our expectations about how developers will adopt future Languages, Experimentation language features. "e make the following contributions in this paper: Keywords we enumerate the assumptions and claims made in the • past about Java generics (Section 4); /enerics, Java, languages, post-mortem analysis we investigate how 20 open source projects have used < • and have not used < Java generics (Section = to >;+ and 2. INTRODUCTIO! we discuss the implications of the adoption and usage Programming languages and tools evolve to match industry • patterns of generics (Section 9). trends, revolutionary shifts, or refined developer tastes. 1ut not all evolutions are successes; the technology landscape is poc2ed with examples of evolutionary dead-ends and dead- 3. AN "VER% &' "( GENERICS on-arrival concepts. In this section we brie@y describe the motivation and use of Permission to make digital or hard copies of all or part of this work for generics. In an effort to maintain consistent terminology, we Permissionpersonal or to classroom make digital use is or granted hard copies without of fee allprovided or part of that this copies work are for present in bold the terms that we use in this paper, drawing personal or classroom use is granted without fee provided that copies are not made or distributed forfor profit or commercial adadvantagevantage and that that copies copies from standard terminology where possible. Readers who are bear this notice and the fullfull citation on the first pagpage.e. To copy copy otherwise, otherwise, to to familiar with Java generics may safely skip this section. republish,republish, to to post on serversservers or to redistribute to list lists,s, requires requires prior prior specific specific permission and/or a fee. permission and/or a fee. 1 MSR’11, ’11, May May 21–22, 21U22," 2011, 2011, Waikiki, %aikiki, Honolulu, &onolulu, HI, HI, USA #() http://download.oracle.com/javase/1.5.0/docs/guide/language/ Copyright 2011 ACM)* 978-1-4503-0574-7/11/05+,8-1-4503-0574-7/11/05 ...$10.00...$10.00. generics.html 3 3.1 Moti*ation +or Generics return (T)internal.get(index); ... In programming languages such as Java, type systems can } ensure that certain kinds of runtime errors do not occur. #or In this code, the T is called the formal type parameter. example, consider the following Java code: The programmer can use her MyList class by instantiating List l = getList(); the formal type parameter by using a type argument [4], System.out.println(l .get(10)); such as Integer or File in the following examples: This code will print the value of the 10th element of the list. MyList<Integer> intList = new MyList<Integer>(); The type system ensures that whatever object getList() MyList<File> leList = new MyList<File>(); returns, it will understand the get message, and no runtime Each place where a generic type declaration is invoked :in type error will occur when invoking that method. In this this example, there are four) is known as a parameterized way, the type system provides safety guarantees at compile type (!]. En the first line, the programmer has declared time so that bugs do not manifest at run time. the type of the intList object so that the compiler knows Now suppose we want to take the example a step further; that it contains objects of type Integer, and thus that the suppose that we 2now that l contains objects of type File, expression intList.get(10) will be of type Integer. The and we would li2e to 2now whether the tenth file in the List result is that the client code is both type safe and clearly is a directory. "e might naturally (and incorrectly) write: expresses the programmer’s intent. The programmer can List l = getList(); also use generic type declarations without taking advantage System.out.println(l .get(10). isDirectory ()); of generics by using them as raw types, such as MyList ob- jectList, in which case the e$pression objectList.get(10) Unfortunately, this leads to a compile-time error, because the will be of type Object. return type of the get method is specified at compile-time In addition to creating their own generic type declara- as Object. The type chec2er gives an error because it does tions, programmers can use generic type declarations from not know what types of objects are actually in the list. libraries. #or example, software developers at Sun gener- In early Java, programmers had two ways to solve this ified [5], or migrated to use generics, the Java collections problem, the first is casting, and the second we call home- classes. #or instance, the List class was parameterized, so grown data structures. If the programmer implements that the previous problem could also be solved li2e so: the casting solution, her code would look like this: List<File> l = getList(); List l = getList(); System.out.println(l .get(10). isDirectory ()); System.out.println((( File)l .get(10)). isDirectory ()); In addition to using generics in type declarations, generics The cast is the (File) part, which forces the compiler to can also be applied to individual methods to create generic recogniCe that the expression l.get(10) actually evaluates methods, like so: to the File type. While this solves one problem, it causes <!> ! bigHead(List<!> as1$ List<!> as%) another; suppose that a programmer at some point later { forgets that the list was intended to hold Files, and inadver- return as1.get(0)> as2.get(0) & as1.get( 0) ' as2.get(0); tently puts a String into the List. Then when this code is } executed, a runtime e$ception will be thrown at the cast. D In this code, the programmer can pass to the bigHead method related problem is that the code is not as clear as it could be, two generic lists containing any type, and the type chec2er because nowhere does the program e$plicitly specify what will assure that those two types are the same. kind of objects the list returned by getList() contains. If the programmer instead implements the home-grown data structure solution, the code will look li2e this* 4. RELATED '"RK In this section, we discuss previous claims about and studies FileList l = getList() ; of generics. System.out.println(l .get(10). isDirectory ()); Additionally, the programmer would need to create a FileList 4.1 Claims Regarding Generics class. ,his solution also introduces new problems. 0erhaps There have been a number of papers and books that have the most significant is the code explosion problem+ for each extolled the benefits of using generics in several contexts. "e and every list that contains a different type, the program- list here a sample of such claims.

Load more