pl.edu.icm.yadda.analysis.relations.general2sesame.auxil
Class FeatureOccurenceCounter_OnSameSurnames

java.lang.Object
  extended by pl.edu.icm.yadda.analysis.relations.PersonDirectoryCreator
      extended by pl.edu.icm.yadda.analysis.relations.general2sesame.auxil.FeatureOccurenceCounter_OnSameSurnames

public class FeatureOccurenceCounter_OnSameSurnames
extends PersonDirectoryCreator

Creates a person directory using a number of so-called Disambiguator s.

Author:
Lukasz Bolikowski (bolo@icm.edu.pl)

Field Summary
 
Fields inherited from class pl.edu.icm.yadda.analysis.relations.PersonDirectoryCreator
backend, regularWeighedDisambiguators, threshold
 
Constructor Summary
FeatureOccurenceCounter_OnSameSurnames()
           
 
Method Summary
protected  ArrayList<String> cachedOccurenceCounter(DisambiguationInterpreter di, List<String> contributionIds, int size)
           
protected  ArrayList<String> calculateOccurences(DisambiguationInterpreter di, List<String> contributionIds, int size)
           
 void createOccurenceCountsCSVFile()
          Creates a person directory.
 void createPersonDirectory(Object[] objects)
          Creates a person directory.
static String generateAffinityId(List<String> strings)
           
static String generatePersonId(List<String> contributionIds)
          Generates a person identifier based on a given list of contributions.
protected  ArrayList<String> occurenceCounter(DisambiguationInterpreter di, List<String> contributionIds, int size)
           
protected  void persistResults(Map<Integer,List<String>> clusters, Object[] objects)
           
protected  ArrayList<String> regularOccurenceCounter(DisambiguationInterpreter di, List<String> contributionIds, int size)
           
 
Methods inherited from class pl.edu.icm.yadda.analysis.relations.PersonDirectoryCreator
calculateAffinity, createPersonDirectory, getBackend, getClusterizer, getThreshold, getWeighedDisambiguators, main, setBackend, setClusterizer, setThreshold, setWeighedDisambiguators
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

FeatureOccurenceCounter_OnSameSurnames

public FeatureOccurenceCounter_OnSameSurnames()
Method Detail

createOccurenceCountsCSVFile

public void createOccurenceCountsCSVFile()
                                  throws Exception
Creates a person directory. Processes contributions group-by-group. Grouping is provided by the person directory backend. For each group, calls the configured Disambiguators in order to assess similarity of contributions. Next, calls the configured Clusterizer to identify persons. Finally, persons are stored in the directory via the configured PersonDirectoryBackend.

Throws:
Exception

createPersonDirectory

public void createPersonDirectory(Object[] objects)
                           throws Exception
Creates a person directory. Processes contributions group-by-group. Grouping is provided by the person directory backend. For each group, calls the configured Disambiguators in order to assess similarity of contributions. Next, calls the configured Clusterizer to identify persons. Finally, persons are stored in the directory via the configured PersonDirectoryBackend.

Overrides:
createPersonDirectory in class PersonDirectoryCreator
Throws:
Exception

occurenceCounter

protected ArrayList<String> occurenceCounter(DisambiguationInterpreter di,
                                             List<String> contributionIds,
                                             int size)

regularOccurenceCounter

protected ArrayList<String> regularOccurenceCounter(DisambiguationInterpreter di,
                                                    List<String> contributionIds,
                                                    int size)
                                             throws org.openrdf.repository.RepositoryException,
                                                    org.openrdf.query.QueryEvaluationException,
                                                    org.openrdf.query.MalformedQueryException
Throws:
org.openrdf.repository.RepositoryException
org.openrdf.query.QueryEvaluationException
org.openrdf.query.MalformedQueryException

cachedOccurenceCounter

protected ArrayList<String> cachedOccurenceCounter(DisambiguationInterpreter di,
                                                   List<String> contributionIds,
                                                   int size)
                                            throws org.openrdf.repository.RepositoryException,
                                                   org.openrdf.query.MalformedQueryException,
                                                   org.openrdf.query.QueryEvaluationException
Throws:
org.openrdf.repository.RepositoryException
org.openrdf.query.MalformedQueryException
org.openrdf.query.QueryEvaluationException

calculateOccurences

protected ArrayList<String> calculateOccurences(DisambiguationInterpreter di,
                                                List<String> contributionIds,
                                                int size)
                                         throws org.openrdf.repository.RepositoryException,
                                                org.openrdf.query.MalformedQueryException,
                                                org.openrdf.query.QueryEvaluationException
Throws:
org.openrdf.repository.RepositoryException
org.openrdf.query.MalformedQueryException
org.openrdf.query.QueryEvaluationException

persistResults

protected void persistResults(Map<Integer,List<String>> clusters,
                              Object[] objects)
                       throws Exception
Overrides:
persistResults in class PersonDirectoryCreator
Throws:
Exception

generatePersonId

public static String generatePersonId(List<String> contributionIds)
Generates a person identifier based on a given list of contributions.

Parameters:
contributionIds - List of contributions by a given person.
Returns:
Person identifier.

generateAffinityId

public static String generateAffinityId(List<String> strings)


Copyright © 2002-2012 ICM, Warsaw University. All Rights Reserved.