Class RegexUtils


  • public class RegexUtils
    extends java.lang.Object
    Inspired from Nutch code class OutlinkExtractor. Apply regex to extract content
    • Constructor Summary

      Constructors 
      Constructor Description
      RegexUtils()  
    • Method Summary

      All Methods Static Methods Concrete Methods 
      Modifier and Type Method Description
      static java.util.List<java.lang.String> extractLinks​(java.lang.String content)
      Extract urls from plain text.
      • Methods inherited from class java.lang.Object

        equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • RegexUtils

        public RegexUtils()
    • Method Detail

      • extractLinks

        public static java.util.List<java.lang.String> extractLinks​(java.lang.String content)
        Extract urls from plain text.
        Parameters:
        content - The plain text content to examine
        Returns:
        List of urls within found in the plain text