Early Development of the Corpus

These publications were based on the corpus before its organisation into a form for external distribution. They are listed in order the studies were carried out, rather than publication order (which due to the vagarities of publication schedules can be quite different), in order to provide a view of the growth of the corpus.
  1. Hayden Melton and Ewan Tempero. Identifying Refactoring Opportunities by Identifying Dependency Cycles. In Proceedings of the Twenty-Ninth Australasian Computer Science Conference, Hobart, Australia, January 2006. [Publisher]
    This paper discussed Azureus, Tomcat, Eclipse, and Netbeans, although a few more (probably about 10 in total) systems had been gathered, using the Purdue Benchmark Suite as the starting point. At this point only sources were being gathered.
  2. Hayden Melton and Ewan Tempero. The CRSS Metric for Package Design Quality. In Proceedings of the Thirtieth Australasian Computer Science Conference, Ballarat, Australia, January 2007. [Publisher]
    This study was on the source of 21 systems.
  3. Homan Ma, Robert Amor and Ewan Tempero. Usage Patterns of the Java Standard API. In Proceedings of the 13th Asia Pacific Software Engineering Conference (APSEC'06), Bangalore, India. [DOI]
    This study was on the source code of the latest versions we had of 76 systems.
  4. Homan Ma, Robert Amor and Ewan Tempero Indexing the Java API Using Source Code Australian Software Engineering Conference (ASWEC), Perth, Australia. March 2008. pp. 451-460.
  5. This study was on the source code of the latest versions we had of 76 systems.
  6. Hayden Melton and Ewan Tempero. An Empirical Study of Cycles among Classes in Java. In Empirical Software Engineering: An International Journal, vol. 12, num. 4, August 2007. [DOI]
    This study was on the bytecode of 78 systems, 8 of which were commercial. It was around the time of this study that we began deliberately gathering multiple versions of systems for longitudinal studies.
  7. Hong Yul Yang and Ewan Tempero. Measuring the Strength of Indirect Coupling. In Proceedings of 2007 Australian Software Engineering Conference (ASWEC'07), Melbourne, Australia. [DOI]
    This study was on the bytecode of about 30 systems.
  8. Gareth Baxter, Marcus Frean, James Noble, Mark Rickerby, Hayden Smith, Matt Visser, Hayden Melton and Ewan Tempero. Understanding the Shape of Java Software. In OOPSLA 2006 Proceedings, Portland, Oregon, USA, October 2006.
    This study was on the bytecode of 56 systems.
  9. Hayden Melton and Ewan Tempero. JooJ: Real-time Support for Avoiding Cyclic Dependencies. In Proceedings of the Thirtieth Australasian Computer Science Conference, Ballarat, Australia, January 2007.
    This used systems in the corpus to test the performance (space and time) of the tool the paper reports on.
  10. Hayden Melton and Ewan Tempero. Static Members and Cycles in Java Software. In Proceedings of the First International Symposium on Empirical Software Engineering and Measurement, Madrid, Spain, September 2007.
    This study was on the bytecode of the latest versions we had of 81 systems.
  11. Richard Barker and Ewan Tempero. A Large-Scale Empirical Comparison of Object-Oriented Cohesion Metrics. In Proceedings of 14th Asia-Pacific Software Engineering Conference (APSEC'07), Nagoya, Japan.
    This study was on the source code of the latest versions we had of 92 systems, 80 open- source and 12 closed-source.
  12. Hong Yul Yang, Hayden Melton and Ewan Tempero 'An Empirical Study into Use of Dependency Injection in Java' 19th Australian Software Engineering Conference (ASWEC), Software Engineering Research Report, University of Auckland Perth, Australia. March 2008. pp. 239-247. [DOI]
    This study was on the bytecode of 34 systems.
  13. Ewan Tempero, James Noble and Hayden Melton 'How do Java Programs Use Inheritance? An Empirical Study of Inheritance in Java Software' 22nd European Conference on Object-Oriented Programming (ECOOP), Springer Berlin / Heidelberg Paphos, Cyprus. July 2008. pp. 667-691. [DOI]

    This study was on the bytecode of 93 open-source systems, however it included longitudinal studies of all systems for which we had multiple versions (although only some appear in the paper).