Package | Description |
---|---|
org.egothor.core |
This package concentrates the core data objects and interfaces.
|
org.egothor.duplicity.algorithm |
This package contains top-level classes that implement the duplicity checking algorithm.
|
org.egothor.duplicity.datastructure |
This package contains datastructures needed in the duplicity checking algorithm.
|
org.egothor.duplicity.file |
This package contains implementation of the files needed the duplicity checking algorithm.
|
org.egothor.duplicity.visualization |
This package contains classes implementing the visualization of the duplicities found in a document by the duplicity checking algorithm.
|
Modifier and Type | Method and Description |
---|---|
DocumentUnitID |
DocumentData.getDocumentUnitID()
Get DocumentUnitID object representing this DocumentData.
|
Modifier and Type | Method and Description |
---|---|
Set<DocumentUnitID> |
TankerImplDuplicityChecker.append(BarrelReader barrelReader,
boolean omitDuplicates,
boolean visualizeDuplicities,
boolean printDuplicitiesToCsv)
Appends a reader to this tanker's duplicity checker.
|
Set<DocumentUnitID> |
DuplicityChecker.append(BarrelReader br,
boolean omitDuplicates,
boolean visualizeDuplicities,
boolean printDuplicitiesToCsv) |
Modifier and Type | Method and Description |
---|---|
static DocumentUnitID |
DocumentUnitID.getInstance(long documentUID,
int documentDBRevision)
This is the recommended way to create an instance.
|
static DocumentUnitID |
DocumentUnitID.load(DataInputStream dis) |
Modifier and Type | Method and Description |
---|---|
boolean |
JaccardCoeficient.isRelatedTo(Set<DocumentUnitID> set)
Returns true, if one of the documents related to this Jaccard coeficient
is present in the given set of documents.
|
boolean |
UnitPair.isRelatedTo(Set<DocumentUnitID> set)
Returns true, if one of the documents from this pair
is present in the given set of documents.
|
boolean |
UnitPermutatedMin.isRelatedTo(Set<DocumentUnitID> set)
Returns true, if the document is present in the given set of documents.
|
Modifier and Type | Method and Description |
---|---|
Map<DocumentUnitID,Double> |
JaccardCoeficientsFile.markDuplicates(List<DocumentData> docs) |
Modifier and Type | Method and Description |
---|---|
Map<TextUnitID,JaccardCoeficient> |
JaccardCoeficientsFile.filterRelevantForDocument(DocumentUnitID doc)
Filter from the file only the entries relevant for given document.
|
Modifier and Type | Method and Description |
---|---|
abstract boolean |
DuplicityCheckingFileElement.isRelatedTo(Set<DocumentUnitID> set) |
void |
CommonSimilarUnitPairsFile.remove(Set<DocumentUnitID> toRemove)
Removes all occurences of documents given in the set from the file.
|
void |
JaccardCoeficientsFile.remove(Set<DocumentUnitID> toRemove)
Removes all occurences of documents given in the set from the file.
|
void |
PermutatedMinsFile.remove(Set<DocumentUnitID> toRemove)
Removes all occurences of documents given in the set from the file.
|
Modifier and Type | Method and Description |
---|---|
static DocumentDuplicities |
DocumentDuplicities.createNew(DocumentUnitID docID,
DocumentData docMeta,
JaccardCoeficientsFile jcf,
TankerImplSecure tanker)
The recommended way to create new instance of DocumentDuplicities child class.
|
Constructor and Description |
---|
DocumentDuplicitiesDocumentLevel(DocumentUnitID docID,
DocumentData docMeta,
JaccardCoeficientsFile jcf,
TankerImplSecure tanker) |
DocumentDuplicitiesParagraphLevel(DocumentUnitID docID,
DocumentData docMeta,
JaccardCoeficientsFile jcf,
TankerImplSecure tanker) |
DocumentDuplicitiesSentenceLevel(DocumentUnitID docID,
DocumentData docMeta,
JaccardCoeficientsFile jcf,
TankerImplSecure tanker) |
Copyright © 2016 Egothor. All Rights Reserved.