wiki/knowledge/ai-tools/clay-email-scraping.md · 587 words · 2026-04-05
Clay-Based Email Scraping for List Building
Overview
Clay is a data enrichment and prospecting tool that can find email addresses for people matching specific criteria. It uses a waterfall enrichment approach — querying 8–20 different data sources in sequence and returning verified emails — making it more effective than single-source lookups. Within the Asymmetric stack, Clay work is handled by Jacob.
This technique is most useful when a client's existing contact list is too small to hit their campaign goals and organic list growth is too slow.
How It Works
- Define your criteria — describe the type of person you're looking for (job title, industry, geography, company type, etc.)
- Clay populates a list — it searches across multiple databases and enrichment providers using a waterfall column approach
- You pay per result — costs are consumption-based but are covered under the Asymmetric Clay subscription
- Export and load — verified emails are exported and loaded into the sending platform (e.g., MailerLite, HubSpot)
When to Use It
- Client has a small list relative to their attendance or lead goals (see [1])
- Target audience is not well-represented on LinkedIn (e.g., teachers, school staff, tradespeople)
- A specific niche or geography needs to be targeted quickly
- Supplementing a list pulled from public records or government databases
Limitations and Challenges
- Low-profile audiences are hard to find. People who don't maintain public professional profiles (teachers, janitors, school safety officers) are underrepresented in Clay's data sources. LinkedIn-dependent enrichment will underperform for these segments.
- Criteria ambiguity reduces quality. If the target role varies by organization (e.g., "health and safety person" could be a teacher, administrator, or custodian depending on the school), match rates drop.
- Not a replacement for organic list building. Clay is best used as a one-time boost or targeted supplement, not a substitute for ongoing lead magnet and content-driven list growth.
Alternative and Complementary Sources
When Clay alone isn't sufficient, consider combining it with:
- Event attendee lists — scrape publicly posted attendee or speaker lists from relevant industry events; people who attend public safety or compliance events are likely targets
- Government / public records databases — e.g., Wisconsin DPI (Department of Public Instruction) has school contact data, though it may be behind per-record lookup interfaces that resist bulk scraping
- Lead magnets — publish a useful infographic or white paper gated behind an email capture form; drives organic, permission-based list growth over time (see [2])
- Internal HubSpot lists — check existing Asymmetric client lists for contacts in the relevant geography or industry segment before going external
Workflow
Define target persona
↓
Check internal HubSpot lists first (free, fast)
↓
Engage Jacob → set up Clay criteria
↓
Clay waterfall enrichment runs
↓
Review and clean results
↓
Load into MailerLite / HubSpot sending list
↓
Send campaign; monitor deliverability
Practical Example
Client: [3] (H.S.)
Need: Grow a 1,900-contact list to reach the ~10,000 contacts mathematically required to fill four asbestos training classes at 100 total attendees (see [4])
Approach: Sebastian to schedule a call with Jacob to scope a Clay scrape targeting school employees in Dane County / Madison area, supplemented by a check of Asymmetric's internal HubSpot lists for relevant school contacts
Key constraint: The target audience (school health & safety designees) has no consistent job title and low public profile, making this a harder-than-average Clay use case.