Interface DocumentSelector


  • public interface DocumentSelector
    Interface for different document selection strategies for purposes like embedded document extraction by a ContainerExtractor instance. An implementation of this interface defines some specific selection criteria to be applied against the document metadata passed to the select(Metadata) method.
    Since:
    Apache Tika 0.8
    • Method Detail

      • select

        boolean select​(Metadata metadata)
        Checks if a document with the given metadata matches the specified selection criteria.
        Parameters:
        metadata - document metadata
        Returns:
        true if the document matches the selection criteria, false otherwise