pl.edu.icm.yadda.analysis.relations.general2sesame.auxil
Class FeatureOccurenceCounter_OnSameSurnames
java.lang.Object
pl.edu.icm.yadda.analysis.relations.PersonDirectoryCreator
pl.edu.icm.yadda.analysis.relations.general2sesame.auxil.FeatureOccurenceCounter_OnSameSurnames
public class FeatureOccurenceCounter_OnSameSurnames
- extends PersonDirectoryCreator
Creates a person directory using a number of so-called Disambiguator
s.
- Author:
- Lukasz Bolikowski (bolo@icm.edu.pl)
Method Summary |
protected java.util.ArrayList<java.lang.String> |
cachedOccurenceCounter(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
|
protected java.util.ArrayList<java.lang.String> |
calculateOccurences(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
|
void |
createOccurenceCountsCSVFile()
Creates a person directory. |
void |
createPersonDirectory(java.lang.Object[] objects)
Creates a person directory. |
static java.lang.String |
generateAffinityId(java.util.List<java.lang.String> strings)
|
static java.lang.String |
generatePersonId(java.util.List<java.lang.String> contributionIds)
Generates a person identifier based on a given list of contributions. |
protected java.util.ArrayList<java.lang.String> |
occurenceCounter(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
|
protected void |
persistResults(java.util.Map<java.lang.Integer,java.util.List<java.lang.String>> clusters,
java.lang.Object[] objects)
|
protected java.util.ArrayList<java.lang.String> |
regularOccurenceCounter(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
|
Methods inherited from class pl.edu.icm.yadda.analysis.relations.PersonDirectoryCreator |
calculateAffinity, createPersonDirectory, getBackend, getClusterizer, getThreshold, getWeighedDisambiguators, main, setBackend, setClusterizer, setThreshold, setWeighedDisambiguators |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
FeatureOccurenceCounter_OnSameSurnames
public FeatureOccurenceCounter_OnSameSurnames()
createOccurenceCountsCSVFile
public void createOccurenceCountsCSVFile()
throws java.lang.Exception
- Creates a person directory. Processes contributions group-by-group.
Grouping is provided by the person directory backend. For each group,
calls the configured
Disambiguator
s in order to assess similarity
of contributions. Next, calls the configured Clusterizer
to
identify persons. Finally, persons are stored in the directory via the
configured PersonDirectoryBackend
.
- Throws:
java.lang.Exception
createPersonDirectory
public void createPersonDirectory(java.lang.Object[] objects)
throws java.lang.Exception
- Creates a person directory. Processes contributions group-by-group.
Grouping is provided by the person directory backend. For each group,
calls the configured
Disambiguator
s in order to assess similarity
of contributions. Next, calls the configured Clusterizer
to
identify persons. Finally, persons are stored in the directory via the
configured PersonDirectoryBackend
.
- Overrides:
createPersonDirectory
in class PersonDirectoryCreator
- Throws:
java.lang.Exception
occurenceCounter
protected java.util.ArrayList<java.lang.String> occurenceCounter(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
regularOccurenceCounter
protected java.util.ArrayList<java.lang.String> regularOccurenceCounter(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
throws org.openrdf.repository.RepositoryException,
org.openrdf.query.QueryEvaluationException,
org.openrdf.query.MalformedQueryException
- Throws:
org.openrdf.repository.RepositoryException
org.openrdf.query.QueryEvaluationException
org.openrdf.query.MalformedQueryException
cachedOccurenceCounter
protected java.util.ArrayList<java.lang.String> cachedOccurenceCounter(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
throws org.openrdf.repository.RepositoryException,
org.openrdf.query.MalformedQueryException,
org.openrdf.query.QueryEvaluationException
- Throws:
org.openrdf.repository.RepositoryException
org.openrdf.query.MalformedQueryException
org.openrdf.query.QueryEvaluationException
calculateOccurences
protected java.util.ArrayList<java.lang.String> calculateOccurences(DisambiguationInterpreter di,
java.util.List<java.lang.String> contributionIds,
int size)
throws org.openrdf.repository.RepositoryException,
org.openrdf.query.MalformedQueryException,
org.openrdf.query.QueryEvaluationException
- Throws:
org.openrdf.repository.RepositoryException
org.openrdf.query.MalformedQueryException
org.openrdf.query.QueryEvaluationException
persistResults
protected void persistResults(java.util.Map<java.lang.Integer,java.util.List<java.lang.String>> clusters,
java.lang.Object[] objects)
throws java.lang.Exception
- Overrides:
persistResults
in class PersonDirectoryCreator
- Throws:
java.lang.Exception
generatePersonId
public static java.lang.String generatePersonId(java.util.List<java.lang.String> contributionIds)
- Generates a person identifier based on a given list of contributions.
- Parameters:
contributionIds
- List of contributions by a given person.
- Returns:
- Person identifier.
generateAffinityId
public static java.lang.String generateAffinityId(java.util.List<java.lang.String> strings)
Copyright © 2002-2012 ICM, Warsaw University. All Rights Reserved.