Description
π§ Who Is This Agent For?
This intelligent automation is built for:
βοΈ Virtual Assistants & Researchers
To extract product information like book title and price without manual data entry
βοΈ Developers & Automation Specialists
As a plug-and-play template for web scraping and data pipeline automation
βοΈ Data Analysts
To get ready-to-use, clean, sorted data from product catalogs for analysis
βοΈ E-commerce Businesses
To monitor pricing trends, competitor listings, or create internal product datasets
β What Problem Does It Solve?
Extracting structured data from websites manually is slow, error-prone, and repetitive. This workflow addresses:
β³ Manual Data Entry
Replaces copy-paste tasks with automatic data extraction
π§© Inconsistent or Messy Data
Ensures structured, uniform output every time
π Time Wastage on Recurring Tasks
Handles ongoing scraping needs with zero manual effort
π No Automation
Adds end-to-end delivery β from scraping to CSV email in one flow
π§Ύ Limited Data Delivery
Delivers the final CSV via email, ready for direct use or further processing
β The Solution
This smart workflow scrapes catalog-style websites (e.g., book stores) and:
π§ Scrapes page content via Dumpling AI
π Extracts product details using HTML & CSS selectors
π’ Sorts data based on fields like price or title
π Converts output into a structured CSV file
π© Delivers the file via Gmail automatically
βοΈ How It Works β The Flow
π₯ 1. Trigger: Watch Google Sheet
The workflow starts when a new URL is added to a monitored Google Sheet
π 2. Scrape Webpage HTML
A Dumpling AI node fetches cleaned HTML from the target webpage
π 3. Extract Book Entries
HTML node extracts book blocks using .row > li as the main selector
π 4. Split Individual Books
Array of HTML blocks is split so each book can be processed independently
π·οΈ 5. Extract Book Title & Price
CSS selectors (h3 > a and .price_color) extract title and price from each book entry
π 6. Sort by Price
Books are sorted in descending order using the price field
π§Ύ 7. Convert to CSV
The sorted data is transformed into a downloadable CSV file
π§ 8. Send Email with CSV Attachment
Email is sent via Gmail node, attaching the CSV and adding a custom message
π οΈ Customization Options
π§© Extract More Fields
Add selectors to pull author, availability, product links, ISBN, or other info
π€ Change Delivery Method
Swap Gmail for Google Drive, Slack, Dropbox, Notion, or database uploads
π Trigger Flexibility
Trigger by webhook, Airtable, form input, schedule, or manual launch
βοΈ Adjust Sorting Logic
Sort by price, title, author, availability, or any numeric/alphabetic field
π Scrape Different Sites
Change the URL and CSS selectors to support other catalogs or product sites
β οΈ Add Error Handling
Insert conditional logic or alerts if scraping fails or output is empty
π§Ό Enrich or Clean Data
Add transformation nodes to standardize, format, or enrich the extracted data