Description
๐ง Who Is This Agent For?
This intelligent automation is built for:
โ๏ธ Virtual Assistants & Researchers
To extract product information like book title and price without manual data entry
โ๏ธ Developers & Automation Specialists
As a plug-and-play template for web scraping and data pipeline automation
โ๏ธ Data Analysts
To get ready-to-use, clean, sorted data from product catalogs for analysis
โ๏ธ E-commerce Businesses
To monitor pricing trends, competitor listings, or create internal product datasets
โ What Problem Does It Solve?
Extracting structured data from websites manually is slow, error-prone, and repetitive. This workflow addresses:
โณ Manual Data Entry
Replaces copy-paste tasks with automatic data extraction
๐งฉ Inconsistent or Messy Data
Ensures structured, uniform output every time
๐ Time Wastage on Recurring Tasks
Handles ongoing scraping needs with zero manual effort
๐ No Automation
Adds end-to-end delivery โ from scraping to CSV email in one flow
๐งพ Limited Data Delivery
Delivers the final CSV via email, ready for direct use or further processing
โ The Solution
This smart workflow scrapes catalog-style websites (e.g., book stores) and:
๐ง Scrapes page content via Dumpling AI
๐ Extracts product details using HTML & CSS selectors
๐ข Sorts data based on fields like price or title
๐ Converts output into a structured CSV file
๐ฉ Delivers the file via Gmail automatically
โ๏ธ How It Works โ The Flow
๐ฅ 1. Trigger: Watch Google Sheet
The workflow starts when a new URL is added to a monitored Google Sheet
๐ 2. Scrape Webpage HTML
A Dumpling AI node fetches cleaned HTML from the target webpage
๐ 3. Extract Book Entries
HTML node extracts book blocks using .row > li as the main selector
๐ 4. Split Individual Books
Array of HTML blocks is split so each book can be processed independently
๐ท๏ธ 5. Extract Book Title & Price
CSS selectors (h3 > a and .price_color) extract title and price from each book entry
๐ 6. Sort by Price
Books are sorted in descending order using the price field
๐งพ 7. Convert to CSV
The sorted data is transformed into a downloadable CSV file
๐ง 8. Send Email with CSV Attachment
Email is sent via Gmail node, attaching the CSV and adding a custom message
๐ ๏ธ Customization Options
๐งฉ Extract More Fields
Add selectors to pull author, availability, product links, ISBN, or other info
๐ค Change Delivery Method
Swap Gmail for Google Drive, Slack, Dropbox, Notion, or database uploads
๐ Trigger Flexibility
Trigger by webhook, Airtable, form input, schedule, or manual launch
โ๏ธ Adjust Sorting Logic
Sort by price, title, author, availability, or any numeric/alphabetic field
๐ Scrape Different Sites
Change the URL and CSS selectors to support other catalogs or product sites
โ ๏ธ Add Error Handling
Insert conditional logic or alerts if scraping fails or output is empty
๐งผ Enrich or Clean Data
Add transformation nodes to standardize, format, or enrich the extracted data













