Oftentimes, all of your data will be in one file and will need to be split out in order to build comparisons. Once you've uploaded your master data file into the platform, you can create subsets of your data (splits) one at a time or in bulk via the Dashboard.
In this article:
The importance of metadata
While the text component of a data set will always be the central object of the analysis, associated metadata points (e.g. demographic details of survey respondents) can be used to split out a master data file.
This will allow you to build multiple comparisons using the same data, to get a 360° view of your audience.
In the example below, we can build at least four unique comparisons using this single data set. In each case, the text being analyzed is the same, but it is grouped differently, based on one gender, age, and location.
Creating splits in bulk
E.g. creating a unique data set for each region from which you've received survey responses.
Bulk splitting can be used to create a maximum of 20 data sets at once.
In the Customer Dashboard, within the relevant project folder, select the tick box next to the dataset you want to split.
Select the advanced split icon from the bulk action menu at the top of your data list.
On the next screen, select which pieces of metadata you want to use as the basis for creating your splits. For each metadata point selected, a unique data subset will be created that only includes text associated with that attribute.
Ensure the setting at the top of the right-hand pane is set to 'Split all results' and if desired, rename the new data sets before clicking 'Split' to finish.
Tip: To keep your data organized, you can keep the splits in relevant folders. To do that, click 'Create new folder' then give it a name and click 'Create.' You can then use the checkboxes to select your data, click the three dots icon, select Move and choose the correct folder.
Combining metadata points into a single split
E.g. group together survey responses to create wider segments, such as
In some situations, you might want to group together several metadata points and their associated text.
Follow the above approach but select only the metadata points you want to group together.
Once you've done this, change the setting at the top of the right-hand panel to 'Combine all' before clicking 'Split' to complete the action.
To rename, simply click the three dots icon, select Edit, give it a new name and click 'Save.'
E.g. creating a subset of your data only including survey responses from respondents, who are based in London and are female.
This operation currently supports 'AND' logic only. If you define multiple conditions only the pieces of text that meet all of them will be included in your new data set.
Hover over the applicable data set and select ‘Split’ from the three-dot menu.
Use the dropdown menu to define a rule for your split then click 'Add Rule.'
Add additional rules by clicking 'Add another'. A preview of the data set size will be presented in the bottom-right corner of the modal. When you're done, click 'Finish it.'
Give the new data set a name and click 'Create the split.'