Skip to Main Content

Research Data Management (RDM)

File Naming

There are two general rules for file organization: Be consistent and Be descriptive. You want to make sure you and your colleagues can find anything you and they are looking for quickly. You'll need to figure out which specifics make the most sense to you and document your convention in a place everyone in your research group can follow. Here are some guidelines to include in your convention:

  • Choose 2-3 descriptors to identify the project or collection the item belongs to and what the specific item is. Have a standard for your research group so things can easily be found and shared.
  • Use capitals (camel case) or underscores instead of periods or spaces.
    • Example: surveyResponseData.csv or survey_response_data.csv
  • Use leading 0's when numbering your files. (000, 001...990, 991)
  • Use no more than 30 characters whenever possible
  • Use date format ISO 8601: YYYY-MM-DD
    • The year first format makes it easy to find newest/oldest files. Wikipedia's ISO 8601 page provides additional information on the date and time standard.
  • Avoid special characters in a file name. Common things to avoid are using spaces or ampersands (&).
  • Document your naming convention so you remember what it is and your project collaborators know what it is.

Penn Libraries at the University of Pennsylvania has created a  File Naming Formula Template to help you and your team create meaningful file names that follow best practices. Once you fill it out, print it out and put it in a prominent place in your workplace. If you work on collaborative files, appoint someone to hold your team accountable for their file naming practices.

Data Versioning

Version control is the strategy of tracking of changes and edits to files and directories. This allows you to revert to previous versions if you make a mistake or even delete something! This can be a key practice for success on complicated projects and on collaborative teams. 

Even if you're tracking changes with the software you're using, you should always keep a copy of the original unedited data available and save a new version when substantial changes are made. It's like saving your progress in a video game along the way so you don't have to go back to the beginning after coming across an unexpected challenge. 

There are two main versions of conducting version control: 

  • Manual Version Control - the process of personally saving versions of your files along the research process. This is good option for those who do not have files that cooperate with software version control (such as rich text files or media files). The file storage system (Box, One Drive) or software (MS Word) you use may have some built in version control, but that is not the main purpose of the tool. Be consistent about when you save another version and how you keep track of your system. 
  • Software Version Control Systems- systems specifically designed to version control code. These are more complex than manual version control, but are more powerful and integrate into your research process easier. These systems save just the changes to your files instead of a new copy for each version.  


Grateful acknowledgement to the University of Pennsylvania Penn Libraries for their permission to use and modify their template: Data Management Resources

MCW Libraries
8701 Watertown Plank Road
Milwaukee, WI 53226
(414) 955-8300

Contact Us
Locations & Hours
Send Us Your Comments