Dataflow kit

Turn websites data into structured data. Web scraper.

Under 10 Employees
Multiple Founders
Founders Code
Content
Open Source
Productivity
Programming
SaaS

The primary goal of DFK is to scrape web pages with simple point, click and extract interface. Our engine is stable enough to parse millions of records from several millions of fetched pages.

Switching to Headless Chrome for content fetching.

We used Splash from Scrapinghub as a Java Script Rendering service at. We’ve switched to Headless Chrome recently as is a really game changer in the scraping field.

Refactoring services

We were forced to refactor our services to meet requirement of processing of big volumes of data. Dataflow kit engine is stable enough to process several Millions of pages from a specified website and generate result successfully.

Started marketing campaign

Started spreading information about Dataflow Kit among Github Open source community members https://github.com/slotix/dataflowkit

The primary goal of DFK is to scrape web pages with simple point, click and extract interface. Our engine is stable enough to parse millions of records from several millions of fetched pages.