Managing and Collecting your data
Last updated
Last updated
To monitor your scraping progress and access results, go to your Task Library. This is where you’ll see the current status of each task you’ve submitted.
Pending: Your task is waiting in the queue.
Running: Miners are actively working on your request.
To build a dataset for a single topic, click “Build Dataset” next to it.
To fulfill your entire task, click “Build All.”
After clicking "Build All" Gravity begins collecting and preparing data across for the selected labels across X and reddit. The task moves through a few key processing stages: Validating Data Sources, Collecting Available Data Sources and Collecting Crawler Information
Validating Data Sources
Gravity is checking each selected topic (like r/tech
or r/googlepixel
) to ensure the source is reachable, active, and properly formatted for extraction.
Collecting Available Data Sources This stage begins retrieving all accessible data from the validated sources.
Collecting Crawler Information Gravity’s crawlers structure the data further.
It takes several minutes for dataset to be built, once your dataset is ready, you’ll get an email notification. Return back to your task library and you’ll also see a “View Dataset” button in your task row. Click it to preview the data.
To download, hit “Download” — files are exported in CSV and Parquet format.
Tasks run for 7 days by default. To stop a task early, click “Cancel.”
If you cancel:
You can still preview or download any completed datasets.
You won’t be able to build new ones from topics till in pending status