WebHarvy scrapes data with point and clicks interface without any coding knowledge. WebHarvy uses an inbuilt browser to load websites for scraping structured data with few mouse clicks.
WebHarvy automatically crawls and extracts data like product listings or search results from multiple web pages. WebHarvy Web Scraper automatically scrapes structured data from web pages when the user points out the ‘link to load the next page’.
WebHarvy Web Scraper scrape data from a list of links of similar pages/listings within a website. This single configuration process allows the user to scrape categories and subcategories within websites.
WebHarvy can easily scrape/extract Image data or image URLs. WebHarvy automatically extracts multiple images from product details pages of e-commerce sites.
WebHarvy automatically classifies data patterns occurring in web pages. If the user needs to scrape/extract a list of items (name, address, email, price, etc.) from a web page, WebHarvy scrapes required structured data without any additional configuration. WebHarvy scrapes data by submitting input keywords to search forms. Any type of input keywords or text fields can be submitted to perform a search. Submitted input keywords data can be extracted for all combinations.
WebHarvy lets users apply Regular Expressions (RegEx) on Text or HTML source of web pages to scrape the matching portion of required data. This powerful and unique technique of WebHarvy provides more flexibility for scraping structured data.
For clicking Links, selecting list/drop-down options, input text to a field, scrolling page and opening popups, WebHarvy is easily configured to perform such tasks.
WebHarvy can save the extracted structured data as an Excel, XML, CSV, JSON or TSV file and export the scraped data to the SQL database.
From being blocked by web servers, WebHarvy has the option to access target websites via proxy servers or VPN.