All Collections
Data
Data manipulations
How do I split data sets?
How do I split data sets?

Relative Insight includes a suite of tools in the Customer Dashboard to help you split data sets.

Trish Pencarska avatar
Written by Trish Pencarska
Updated today

Oftentimes, all of your data will be in one file and will need to be split out in order to build comparisons. Once you've uploaded your master data file into the platform, you can create subsets of your data (splits) one at a time or in bulk via the Dashboard.

In this article:

The importance of metadata

While the text component of a data set will always be the central object of the analysis, associated metadata points (e.g. demographic details of survey respondents) can be used to split out a master data file.

This will allow you to build multiple comparisons using the same data, to get a 360° view of your audience.

In the example below, we can build at least four unique comparisons using this single data set. In each case, the text being analyzed is the same, but it is grouped differently, based on one gender, age, and location.

Creating Instant Splits from Data Discovery

The easiest way to create segments is by using the Instant Splits feature. Using metadata charts in Data Discovery, you can create your target audience by selecting multiple characteristics.

  1. Head to the Data Discovery and select the Interactive Metadata tab.

  2. Click on the part of the chart representing the characteristic you want to analyze (e.g. age range). The selection will appear in the Segment sidebar.

  3. On a different chart, select the second characteristic that corresponds to your target audience (e.g. country).

  4. Depending on the size of your data, you may want to select additional metadata points to narrow down your audience.

  5. Once you are happy with your selection, in the Segment sidebar, name your split and click ‘Create & apply’.

Creating splits in bulk

E.g. creating a unique data set for each region from which you've received survey responses.

Bulk splitting can be used to create a maximum of 20 data sets at once.

  1. In the Customer Dashboard, within the relevant project folder, select the tick box next to the dataset you want to split.

  2. Select the advanced split icon from the bulk action menu at the top of your data list.

  3. On the next screen, select which pieces of metadata you want to use as the basis for creating your splits. For each metadata point selected, a unique data subset will be created that only includes text associated with that attribute.

  4. Ensure the setting at the top of the right-hand pane is set to 'Split all results' and if desired, rename the new data sets before clicking 'Split' to finish.

  5. Tip: To keep your data organized, you can keep the splits in relevant folders. To do that, click 'Create new folder' then give it a name and click 'Create.' You can then use the checkboxes to select your data, click the three dots icon, select Move and choose the correct folder.

Combining metadata points into a single split

E.g. group together survey responses to create wider segments, such as

In some situations, you might want to group together several metadata points and their associated text.

  1. Follow the above approach but select only the metadata points you want to group together.

  2. Once you've done this, change the setting at the top of the right-hand panel to 'Combine all' before clicking 'Split' to complete the action.

  3. To rename, simply click the three dots icon, select Edit, give it a new name and click 'Save.'

Multi-condition splits

E.g. creating a subset of your data only including survey responses from respondents, who are based in London and are female.

This operation currently supports 'AND' logic only. If you define multiple conditions only the pieces of text that meet all of them will be included in your new data set.

  1. Hover over the applicable data set and select ‘Split’ from the three-dot menu.

  2. Use the dropdown menu to define a rule for your split then click 'Add Rule.'

  3. Add additional rules by clicking 'Add another'. A preview of the data set size will be presented in the bottom-right corner of the modal. When you're done, click 'Finish it.'

  4. Give the new data set a name and click 'Create the split.'

Did this answer your question?