An Empirical Study of Function Overloading in C++

Eighth IEEE International Working Conference on Source Code Analysis and Manipulation An Empirical Study of Function Overloading in C++ Cheng Wang and Daqing Hou Clarkson University Potsdam, NY USA 13699 [email protected] Abstract of example code that demonstrate the use of a particular language feature, it is challenging for the programmers to The usefulness and usability of programming tools decide what features to learn that can solve their problems (for example, languages, libraries, and frameworks) may at hand in the best possible way. It would be ideal if for each greatly impact programmer productivity and software qual- language feature, there is a methodology defined that guides ity. Ideally, these tools should be designed to be both useful the programmers on what kind of a problem the feature is and usable. But in reality, there always exist some tools useful for and how to solve the problem properly with the or features whose essential characteristics can be fully un- feature [11]. derstood only after they have been extensively used. The The formulation of such a methodology could be best study described in this paper is focused on discovering how done by examining and reflecting on as much as possible C++’s function overloading is used in production code us- empirical evidence from the serious use of the particular ing an instrumented g++ compiler. Our principal finding language feature of interest. While language designers tend for the system studied is that the most ‘advanced’ subset of to do their best in guiding their design process with practi- function overloading tends to be defined in only a few utility cal experience and user feedback [2, 11, 14], not all design modules, which are probably developed and maintained by decisions are, or can be, made based on solid empirical ev- a small number of programmers, the majority of application idence. Sometimes, there simply is not a sufficient amount modules use only the ‘easy’ subset of function overloading of experience available at the time a particular design de- when overloading names, and most overloaded names are cision is made [14]. For sophisticated features like over- used locally within rather than across module interfaces. loading and templates, while there is no doubt that such a We recommend these as guidelines to software designers. feature as a whole is useful, its interactions with other features can be numerous and the details can be subtle to pin down. Sometimes, decisions can be made in an arbitrary 1 Introduction and ad hoc fashion, or based on particular requirements that are controversial (for example, the decision for C++ to sup- Programming languages are one of the many important port implicit narrowing in overloading resolution in order tools that programmers use in their daily problem solving. to maintain compatibility with C). For all of these reasons, With the ever increasing size and complexity of the prob- there is always a need to examine how language features are lems tackled by computer software rises the sophistication actually used in the wild in order to understand the conse- of programming tools. This is a bad news for the aver- quences of such decisions, to identify typical use cases of age programmers because to do their jobs properly, they language features, and to develop guidelines. must master their tools first, which will require more ef- One way to gather such empirical evidence is by ana- fort when the tools grow bigger and become more complex. lyzing the source code of large-scale systems. We chose to In particular, current programming languages are providing study the use of the compilation-time overloading feature programmers with an increasingly large set of features to in C++ because it involves a large set of rules that many support them in solving diverse problems. Consequently, programmers would not be confident with. Furthermore, mastering a whole programming language demands serious C++ overloading interacts with many other features of the investment on the part of the programmers. 1 While it is rel- language, for example, namespaces, inheritance, and tem- atively easy to learn how to write and compile small pieces some of the complexity to external libraries and APIs. However, we be- 1Recognizing this problem of increasing language scale, some lan- lieve that learning such a minimal language can still be a challenge for the guage designers are trying to develop a minimal core language and shift average programmers. 978-0-7695-3353-7/08 $25.00 © 2008 IEEE 47 DOI 10.1109/SCAM.2008.25 class Y; class X f ture. That is, given an expression that applies an overloaded public: name, a compiler needs to choose a single candidate that operator char() const freturn ’a’;g best matches the expression (overloading resolution). The void foo (int); //f1 void foo (char);//f2 compiler thus needs to implement a set of rules that govern void foo (double);//f3 void foo (X);//f4 1. how to identify from the surrounding scopes of the ex- void foo (Y&);//f5 pression the set of candidate operations with the same g; name as what is used by the expression (candidate set), class Y: public X f g; 2. how to convert from argument types appearing in the void foo (double); //f6 calling expression to the parameter types in the defini- void foo (int);//f7 tion of a candidate operation and rank the conversions, void bar(Y & aY)f //C=ff6,f7g, V=ff6,f7g best: 3. how to determine the set of operations to whose param- //f7<cr promotion:ck std> eter types there exist feasible conversions from types in foo(’c’); the expression (viable set), and //C=ff6,f7g,V=ff6,f7g best: //f7 <cr user:ck user->ck std> foo(aY); 4. how to determine the best candidate from the viable set //C=ff1...f5g,V=ff1,f2,f3g best: as the final result that the expression should be linked //f2 <cr std:ck ptr,cr identity:ck identity> to. If no such candidate can be found, the process fails aY.foo(’a’); //C=ff1...f5g,V=ff1...f5g best: and a compilation error is given to the programmer. //f5 <cr std:ck ptr,cr identity:ck ref bind> aY.foo(aY);g Table 1 summarizes the implicit type conversion rules of C++. A rank on top of the table is considered better than ones below it. Also note that a rank may contain sub- Figure 1. Example of function overloading categories known as kind. and the process of overloading resolution. C Figure 1 shows a simple example designed to illustrate stands for candidate set and V for viable set. the process of overloading resolution in C++. For each overloaded call in bar(), its candidate set and viable set, plates. Thus we felt that overloading would be a complex the best candidate, and the conversion sequences from ar- feature for which a methodology is needed to guide its use. guments to parameter s are given in the comment above the There is also doubt on the usefulness of compilation-time call. For example, foo(’c’) has a candidate set that con- overloading [11]. We hoped that our study could shed some tains f6 and f7. Its viable set also contains f6 and f7 light on this controversy as well. since there are conversions from char to both double In order to gather data about the use of overloading in (for f6) and int (for f7). The conversion from char to practical systems, the GNU g++ compiler 2 is modified and int has a rank of cr promotion and a kind of ck std. used to compile a Mozilla browser. 3 In this paper, we ana- which is represented as <cr promotion:ck std> in lyze the data and report our findings about the use of over- the comment. Note that in this example, foo(aY) has loading in the Mozilla code base. Analysis of data extracted a conversion sequence of length 2, and the conversion se- from another system (MySQL 4) is currently under way. quences of all other calls are of length 1. The rest of this paper is organized as follows. Section 2 Overloading, when used appropriately, can increase pro- provides a brief overview of C++ overloading. Section 3 gram readability and clarity [10]. Without this feature, it describes the study method, how g++ is instrumented, and would be rather difficult to designate different but meaning- what kind of data are gathered. Section 4 presents the re- ful names to the several foo’s in Figure 1 when the opera- sults of our analysis. Section 5 presents related work. Sec- tions have similar behavior but different parameter types. tion 6 concludes the paper and anticipates future work. On the other hand, the size of Table 1 indicates that C++ overloading rules are rather complex to understand. But this 2 Overloading in C++ is the result of a careful design that accommodates multiple design requirements of C++ [14]. Each conversion rule has a reason. For example, ck std results from the design de- Overloading allows a single name to be used to repre- cision to maintain compatibility with C’s basic types and sent more than one operation. It is a compile-time fea- conversions (narrowing), and ck ptr from the interaction with inheritance. cr user is designed to make it possi- 2http://gcc.gnu.org/ 3http://www.mozilla.org/ ble for user-defined types to be used in the same way as 4www.mysql.com primitive types. In particular, with the addition of operator 48 rank kind an overloading call among the candidates (for exam- cr identity ck identity (two identical ple, by designing the candidates such that they have types), ck lvalue (e.g., array to obviously different parameters).

An Empirical Study of Function Overloading in C++

CSE 307: Principles of Programming Languages Classes and Inheritance

Polymorphism

C/C++ Program Design CS205 Week 10

CHAPTER-3 Function Overloading SHORT ANSWER QUESTIONS 1

Comparative Studies of 10 Programming Languages Within 10 Diverse Criteria Revision 1.0

C++ Programming Fundamentals Elvis C

Virtual Base Class, Overloading and Overriding Design Pattern in Object Oriented Programming with C++

The C++ Language C++ Features

Extras - Other Bits of Fortran

Guidelines for the Use of the C++14 Language in Critical and Safety-Related Systems AUTOSAR AP Release 18-03

Modern Fortran OO Features

External Procedures, Triggers, and User-Defined Function on DB2 for I