Skip to main content

Manage data during research

Best practices for file formats

Before starting a project, it is important to think about file formats as this may have implications for the life of the data. Follow these best practices to reduce the chances of data loss from software or data obsolescence.

File formats when collecting data

  • Open and non-proprietary data are preferable
  • If data must be in a proprietary format, ensure that it can easily be converted to an open, non-proprietary format
  • Select formats commonly used by the research community

Further reading: Best practices for choosing a file format for acquisition (U.S. Geological Survey)

File formats when sharing / preserving data

  • Format should be open, non-proprietary, and machine-readable
  • Share multiple formats if the format used by research community is typically proprietary (e.g., MonaLisa_v1.psd AND MonaLisa_v1.tiff)
  • For proprietary files, indicate (using a README file) software/hardware needed to open files
  • If compression is necessary, use a lossless format

Further reading: Best practices for public data release formats (U.S. Geological Survey)

Recommended formats for sharing, reuse, and preservation

Type of data Recommended formats Acceptable formats
Tabular data
(with extensive metadata)

variable labels, code labels, defined missing values
.por (SPSS portable format) sav, .dta, .mdb,.accdb
Tabular data
(with minimal metadata)

column headings, variable names
.csv, .tab .txt, .xls, .xlsx, .mdb, .accdb, .dbf, .ods
Geospatial data
vector & raster data
.shp, .shx, .dbf, .prj, .sbx, .sbn, .tif, .tfw, .dwg, .gml .mdb, .mif, .kml, .dxf, .svg
Textual data .rtf, .txt, .xml .html, .doc, .docx
Image data .tif (TIFF 6.0) .jpeg, .jpg, .jp2, .gif, .tif, .tiff, .raw, .psd, .bmp, .png, .pdf
Audio data .flac .mp3, .aif, .wav
Video data .mp4, ogv, .ogg, .mj2 .avchd
Documentation and scripts .rtf, .pdf, .xhtml, .htm, .odt ..txt, .doc, .docx, .xls, .xlsx, .xml
Chemistry data
spectroscopy
.jdx (JCAMP)  

Sources: Recommended formats (UK Data Service); Data types & file formats (Oregon State University Libraries)

Further reading: Consult the annually updated Library of Congress Recommended Formats Statement for more information on recommended file formats.

Help and resources

Research data management consultations are available for Concordia faculty, students, and staff. Find out more about how librarians on the Library's RDM team can provide guidance. This service is part of Concordia's Institutional Research Data Management Strategy.

Back to top

© Concordia University