About Us


This web site is a joint effort of several entities that have teamed up to offer curated, privacy-safe datasets for software engineers and researchers.

For requests about a specific dataset, please follow the link to the original authors or forge and submit a request there.

For any other request on the web site itself or on the initiative, please contact Boris Baldassari from Castalia Solutions.




Castalia Solutions

Castalia Solutions is a consulting and service provider that offers professional expertise on software quality and software-related open data.

Castalia Solutions hosts the web site, and provides the technical expertise for the retrieval, processing and analysis of data.

Web site: http://castalia.solutions  
Contact: https://castalia.camp/contact  




Crossminer: Developer-Centric Knowledge Mining from Large Open-Source Software Repositories

Crossminer enables the monitoring, in-depth analysis and evidence-based selection of open source components, and facilitates knowledge extraction from large open-source software repositories. The project receives funding under the European Union's Horizon 2020 Research and Innovation Programme under grant agreement No. 732223.

Web site: https://crossminer.org  
Contact: https://www.crossminer.org/contact  




Eclipse Foundation

The Eclipse Foundation is an open source community of Tools, Projects and
Collaborative Working Groups. It is also one of the major forges for open-source sofware, with a focus on industry-grade solutions.

The Eclipse Foundation has put an important effort to make the forge's data available through a set of API endpoints and is directly involved in all Eclipse-related datasets.

Web site: https://eclipse.org  
Contact: https://www.eclipse.org/org/foundation/contact.php