public class DuplicityChecker extends Object implements Status
createDuplicityCheckingFilesForBarrel(org.egothor.core.BarrelReader, java.lang.String)
allow computing duplicity checking files for single barrel. The method
mergeDuplicityCheckingFiles(java.lang.String, java.lang.String, java.util.Set<org.egothor.duplicity.datastructure.DocumentUnitID>)
allows merging duplicity checking files of two barrels.
The structure of the duplicity checking algorithm files on filesystem is this.
dupliticity (directory) |--> temp (directory) - contains temporary files for classes |This class has a state consisting of the seed ofSimilarUnitPairsTempFile
and |AllSimilarUnitPairsFile
|--> fdpiX files - correspond toPermutatedMinsFile
|--> fhpiX files - correspond toSimilarUnitPairsFile
- these files are no more computed
PermutatedMinsFiller
object
and sign empty
.Constructor and Description |
---|
DuplicityChecker() |
Modifier and Type | Method and Description |
---|---|
Set<DocumentUnitID> |
append(BarrelReader br,
boolean omitDuplicates,
boolean visualizeDuplicities,
boolean printDuplicitiesToCsv) |
void |
close() |
void |
commit() |
void |
destroy() |
String |
getDuplicityCheckingMainDir() |
String |
getDuplicityCheckingReportDir() |
String |
getDuplicityCheckingTempDir() |
Properties |
getState(Properties p)
Return the state of this object.
|
void |
initialize(String indexMainDir,
TankerImplSecure tanker) |
boolean |
isEmpty() |
void |
removeDoc(long uid,
int docDBRevision) |
void |
setState(Properties state)
Set this object with the given properties.
|
public boolean isEmpty()
public String getDuplicityCheckingMainDir()
public String getDuplicityCheckingReportDir()
public String getDuplicityCheckingTempDir()
public void initialize(String indexMainDir, TankerImplSecure tanker) throws IOException
indexMainDir
- tanker
- IOException
public void commit()
public void close()
public Properties getState(Properties p)
public void setState(Properties state)
public Set<DocumentUnitID> append(BarrelReader br, boolean omitDuplicates, boolean visualizeDuplicities, boolean printDuplicitiesToCsv) throws DuplicityCheckingException
br
- omitDuplicates
- visualizeDuplicities
- printDuplicitiesToCsv
- DuplicityCheckingException
public void destroy()
public void removeDoc(long uid, int docDBRevision)
uid
- docDBRevision
- Copyright © 2016 Egothor. All Rights Reserved.