Entity ExtractionPublished by admin at May 17, 2016
Categories : Projects
Tags : Alchemy API, Crawling, Entity Extraction, Parsing, Web Scraping

We have developed the crawler and parsers for this project. This system would crawl the web and store the content into multi terabyte storage system and then parsers would mine/extract the relevant piece of information from the HTML content and it will aggregate them and generate a node for publishing.

Very few samples from the list we have done in this domain,

  • Address Extraction
  • Country Guessing for a URL
  • Company Name Extraction
  • Phone/Email Extraction
  • Image Extraction

Related Posts

Enterprise Search Published by admin at May 10, 2016
Categories : Projects
Tags : Cloudera, Enterprise Search, Solr

Read more

Cloud Migration Published by admin at May 17, 2016
Categories : Projects
Tags : AWS, Azure, Cloud Migration, EC2, Google, RDS

Read more

Data Categorisation Published by admin at May 17, 2016
Categories : Projects
Tags : Tag Classification, Tag Cloud, Tag Entity

Read more
@ 2019 DataBrains.  All rights reserved
Back To Top