public class DocumentsGenerator
extends Object
Use this program to generate HTML documents with given characteristics.
The program takes a set of input documents and according to the given parameters
creates "plagiated" documents using them. The program works just as plagiators do,
it takes different parts of different documents and copy-pastes parts of them together,
slightly modifying them, optionally adding some original parts of text.
Usage:
java DocumentsGenerator input-dir output-dir documents paragraphs sentences words plagiarism-ratio word-changes
[paragraphs-deviation sentences-deviation words-deviation plagiarism-ratio-deviation word-changes-deviation] [-noprompt]
input-dir: specifies a path to the directory where input documents will be searched
output-dir: specifies a path to the directory where output documents will be placed
documents: number of output documents
paragraphs: mean value of paragraphs in output documents
paragraphs-deviation: standard deviation of paragraphs in output documents
sentences: mean value of sentences per paragraph in output documents
sentences-deviation: standard deviation of sentences per paragraph in output documents
words: mean value of words per sentence in output documents
words-deviation: standard deviation of words per sentence in output documents
plagiarism-ratio: 0-100 number meaning the mean value of plagiarised text amount in output documents
or, the number of text unit on the actual duplicity checking level that are plagiarised
ignored for the document level
word-changes-ratio: 0-100 number meaning the mean value of single word changes in a plagiarised text unit
noprompt: the application does not prompt the user for anything
- Author:
- Kateřina Dufková