Package org.mycore.mets.solr
Class MCRSolrAltoExtractor
java.lang.Object
org.mycore.mets.solr.MCRSolrAltoExtractor
- All Implemented Interfaces:
MCRSolrFileIndexAccumulator
Extract content and word coordinates of ALTO XML and adds it to the alto_words and alto_content field.
- Author:
- Matthias Eichner
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionvoid
accumulate
(org.apache.solr.common.SolrInputDocument document, Path filePath, BasicFileAttributes attributes) Adds additional information to a File.
-
Constructor Details
-
MCRSolrAltoExtractor
public MCRSolrAltoExtractor()
-
-
Method Details
-
accumulate
public void accumulate(org.apache.solr.common.SolrInputDocument document, Path filePath, BasicFileAttributes attributes) throws IOException Description copied from interface:MCRSolrFileIndexAccumulator
Adds additional information to a File.- Specified by:
accumulate
in interfaceMCRSolrFileIndexAccumulator
- Parameters:
document
- which holds the informationfilePath
- to the file in a derivateattributes
- of the file in a derivate- Throws:
IOException
-