Go to main content

Open data format

Open, non-proprietary format for the exchange of research data and metadata for use with popular statistical programs.

The “Open Data Format” offers an innovative solution for data processing and data exchange in research. The aim of the project is to develop an open, non-proprietary, multilingual data format that is enriched with additional information and can be used in common statistical programs. The enriched metadata contributes to optimizing the research process and fulfilling the requirements of the FAIR principles.

For Research Data Centers (RDC)

The open data format provides an efficient way to create a unified data format that can be used by a wide range of users, facilitated by easily available and installable import filters. This avoids the necessity for data producers to create multiple formats for different user requirements, consequently guaranteeing optimized data processing. Moreover, there’s the possibility to enrich the data with supplementary information, which was previously difficult to realize due to software-specific limitations, thereby enhancing data documentation. The format also offers benefits for long-term archiving, as the data can be used independently of proprietary software.

For Researchers

The “Open Data Format” enables data users to process and analyze data in various software environments, offering efficient and flexible workflows without depending on proprietary software. Data users do not need to change their usage habits. Additionally, more information than usually provided through conventional data formats is available through the Open Data Format, such as links to data portals that can be directly accessed via statistical software. With the adoption of the Open Data Format, data users gain new opportunities to access a broader range of datasets.

A first version of the specification (https://git.soep.de/opendata/specification) is available since 2022.

More information