Data cleaning allows you to exclude certain elements of the language set that may otherwise distort the analysis.
The available options will vary depending on the data source you are uploading but may include the removal of:
Spam and promotions 🗑️
Duplicates and similar posts
Posts from bots, public figures and organisations 🤖🏢
Data cleaning can be enabled at the time of upload, or afterwards from within the Data Library.
Select the applicable data source/format
Use the small arrow next to 'Show cleaning options' to display cleaning options
Select the applicable cleaning options you wish to enable
From the Data Library
Use the checkboxes to select the data set(s) for which you want to enable or change cleaning options
Select the washing machine icon at the top of the screen
Select the desired options
Note: any comparisons that have been built using a language set for which you have changed the cleaning options will automatically be updated.