Simple text file obtained from manually scraping the web for the question "What is Machine Learning?".
The files contain the first paragraph/ page on the website's approach to answer the question. This data is not used for commercial purposes and is available to all.
This data is used in TAES to show how it can be used for plagiarism checking. The text files (*.txt) contain plain text and need no preprocessing to use. Simply read the file and assign the data to a string object.