Download WebHarvy Software

WebHarvy scrapes data with point and clicks interface without any coding knowledge. WebHarvy uses an inbuilt browser to load websites for scraping structured data with few mouse clicks.

WebHarvy automatically crawls and extracts data like product listings or search results from multiple web pages. WebHarvy Web Scraper automatically scrapes structured data from web pages when the user points out the ‘link to load the next page’.

Image result for webharvy

WebHarvy Web Scraper scrape data from a list of links of similar pages/listings within a website. This single configuration process allows the user to scrape categories and subcategories within websites.

WebHarvy can easily scrape/extract Image data or image URLs. WebHarvy automatically extracts multiple images from product details pages of e-commerce sites.

WebHarvy automatically classifies data patterns occurring in web pages. If the user needs to scrape/extract a list of items (name, address, email, price, etc.) from a web page, WebHarvy scrapes required structured data without any additional configuration. WebHarvy scrapes data by submitting input keywords to search forms. Any type of input keywords or text fields can be submitted to perform a search. Submitted input keywords data can be extracted for all combinations.

Image result for webharvy

WebHarvy lets users apply Regular Expressions (RegEx) on Text or HTML source of web pages to scrape the matching portion of required data. This powerful and unique technique of WebHarvy provides more flexibility for scraping structured data.

For clicking Links, selecting list/drop-down options, input text to a field, scrolling page and opening popups, WebHarvy is easily configured to perform such tasks.

WebHarvy can save the extracted structured data as an Excel, XML, CSV, JSON or TSV file and export the scraped data to the SQL database.

From being blocked by web servers, WebHarvy has the option to access target websites via proxy servers or VPN.

Image result for webharvy

WebHarvy is highly supported by JavaScript and users can run his own JavaScript code in the browser before scraping/extracting data. This process is used to interact with page elements, modify DOM or invoke JavaScript functions.

Leave a Comment

Your email address will not be published. Required fields are marked *

Translate »