How it works
CatchAll processes queries through a multi-stage pipeline that analyzes over 50,000 web pages per job:Analyze
Generates targeted search queries for NewsCatcher’s proprietary news index
and creates validation rules and extraction patterns based on your input.
Cluster
Groups related articles into distinct real-world events using the Leiden
algorithm for community detection.
Validate
Applies generated validators to filter clusters, ensuring only relevant
events that match your criteria proceed to extraction.
Processing typically takes 10-15 minutes per job. Poll the status endpoint
every 30-60 seconds to track progress through each stage.
Key characteristics
Event-centric index
Event-centric index
CatchAll searches NewsCatcher’s continuously updated index of 2+ billion web pages,
optimized for finding real-world events (acquisitions, approvals, incidents)
within a recent timeframe. The system excels at comprehensive event discovery,
not static content retrieval.
Dynamic schemas
Dynamic schemas
Each job generates a unique response schema. Field names and structure in
the
enrichment object vary between jobs—even with identical inputs.Guaranteed fields in every record:record_idrecord_titleenrichmentobjectcitationsarray
- All fields inside
enrichment(names, types, structure)
Customizable extraction
Customizable extraction
Control what data gets extracted by providing custom validators and enrichments,
or let the system generate them automatically based on your query.Custom validators filter which events are relevant:Custom enrichments define what data to extract:The
company enrichment type extracts structured data including name,
alternative names, website candidates, people, and address.Date range controls
Date range controls
Specify custom date ranges for article search, or let the system determine
the optimal time window based on your query (default: 5 days).Date ranges are validated against your plan’s allowed lookback period.
If your requested dates exceed plan limits, the API returns a 400 error
with specific guidance.
Non-deterministic processing
Non-deterministic processing
Identical queries can produce different results:
- LLMs may generate different keywords, validators, and extractors
- Different content sources may be retrieved
- Field names and structure vary between runs
- Record counts differ
Asynchronous operation
Asynchronous operation
Each query creates a job that processes asynchronously. Use the returned
job_id to poll the job status and retrieve results when completed.
Processing typically takes 10-15 minutes.Track detailed progress through the steps
array in the status endpoint response.Batch processing
Batch processing
Results become available progressively during the
enriching stage as
validation completes in batches. Check for status: "enriching" to retrieve
partial results before job completion.The progress_validated field tracks how many candidate clusters have been
processed. This allows you to access early results while the job continues
processing remaining batches.Job continuation
Job continuation
Start with fewer records using the
limit parameter for quick testing, then use
POST /catchAll/continue to process more
records without re-submitting the query.Continue requests preserve all analysis, validation, and extraction logic
from the original job.Endpoints
Base URL:https://catchall.newscatcherapi.com
- Jobs
- Monitors
- Meta
| Endpoint | Method | Description |
|---|---|---|
/catchAll/initialize | POST | Get validator, enrichment, and date suggestions |
/catchAll/submit | POST | Create a new job |
/catchAll/continue | POST | Continue job with higher limit |
/catchAll/jobs/user | GET | List all jobs for your API key |
/catchAll/status/{job_id} | GET | Check job processing status |
/catchAll/pull/{job_id} | GET | Retrieve job results |
Use cases
- Market intelligence: Company earnings, M&A activity, product launches
- Regulatory monitoring: Policy changes, government actions, compliance updates
- Business development: Partnerships, funding rounds, market entries
- Competitive analysis: Competitor activities and announcements
- Research automation: Structured data extraction for analysis
- News aggregation: Topic-specific news with structured output
What’s next
Quickstart
Make your first request and get results in minutes
Monitors
Automate recurring queries with scheduled execution
API Reference
Detailed endpoint documentation and parameters
Dynamic Schemas
Handle variable response structures in your integration
For technical support, contact us at support@newscatcherapi.com.

