… . . . . . . . . . . . 152 Recall-Oriented Learning of Named Entities in Arabic Wikipedia Behrang Mohit, Nathan Schneider, Rishav Bhowmick, Kemal Oflazer and Noah A. Smith . . . . 162 ix Tree Representations in Probabilistic Models for Extended Named Entities Detection Marco Dina…
…Solr or Elasticsearch . Grub was an open source distributed search crawler that Wikia Search used to crawl the web. Heritrix is the Internet Archive 's archival-quality crawler, designed for archiving periodic snapshots of a large portion of the Web. It was written in Java . ht:/…