Automated monitoring system for discovering loan-related radio advertisers, keyword opportunities, and direct-response ad signals from U.S. talk/news radio stations.
This system automatically monitors selected U.S. radio stations and captures advertising signals related to loans and financial services. Instead of manually listening to hours of radio, the pipeline records chunks, transcribes audio, detects keyword matches, identifies likely advertisers, and sends detections into a review queue.
The result is a repeatable intelligence system that helps us discover:
Convert offline radio advertising signals into online search advertising opportunities.
Loan advertisers often test offers on radio before, during, or alongside search campaigns. These ads contain brand names, phone numbers, URLs, offer language, disclaimers, and intent-heavy phrases that are valuable for keyword research.
Manual research is slow because:
Operator picks U.S. stations by market, format, and likelihood of direct-response financial ads.
System verifies each station stream is reachable before a harvest starts.
Captures short audio chunks from live streams on a rolling basis.
Audio chunks are converted to text so ads become searchable.
Looks for loan and financial-service phrases across transcripts.
Each detection saved with station, market, timestamp, transcript, match, and confidence.
Detections, queue, and station health visible to the operator.
Human review validates advertiser and keyword candidates.
Approved keywords flow to paid search and affiliate research.
| Radio Harvest Dashboard | Completed | Internal route at /radio-harvest |
| Station Probe | Completed | 9/9 stations reachable in latest test |
| Overnight Harvest Profile | Completed | Can start overnight keyword harvest |
| Stop Harvest Control | Completed | Operator can stop active harvest |
| Detection View | Completed | Detections visible from dashboard |
| Queue View | Completed | Queue status visible |
| Station View | Completed | Station availability visible |
| Automated Tests | Completed | Harvest dashboard covered by tests |
| Loan Keyword Detection | Active | Detects loan & financial-service phrases |
| Advertiser Review Workflow | In Progress | Detections need human review before campaign use |
| Export / Manager Reporting | Needed | This page supports manager visibility |
| Station | Market | Format | Status |
|---|---|---|---|
| KLIF 570 AM | Dallas, TX | News/Talk | Reachable |
| WBAP 820 AM | Dallas/Fort Worth, TX | News/Talk | Reachable |
| KTRH 740 AM | Houston, TX | News/Talk | Reachable |
| WOAI 1200 AM | San Antonio, TX | News/Talk | Reachable |
| WHBO 1040 | Tampa, FL | News/Talk | Reachable |
| WSB 750 AM | Atlanta, GA | News/Talk | Reachable |
| WTAM 1100 AM | Cleveland, OH | News/Talk | Reachable |
| WIBC 93.1 FM | Indianapolis, IN | News/Talk | Reachable |
| WWTN 99.7 FM | Nashville, TN | News/Talk | Reachable |
These stations are high-priority because talk/news radio tends to carry direct-response financial ads — loan, debt, tax, insurance, and cash-related offers.
Checks whether configured station streams are reachable.
$ harvest_control.py probe --limit 20Starts the overnight keyword monitoring profile.
$ harvest_control.py start --profile overnight_keyword_harvestStops the currently running harvest process.
$ harvest_control.py stopShows whether the system is running, idle, stopped, or errored.
Captured keyword detections with station, timestamp, confidence, transcript snippet.
Pending, processed, dropped, or failed audio chunks.
Configured stations and stream reachability.
Illustrative examples — not yet connected to live backend data.
The system found loan-related ad detections connected to this advertiser across multiple target stations. Repeated cross-market coverage is exactly the signal we want — it suggests active media buying and direct-response testing.
| Station | Market | Confidence |
|---|---|---|
| KLIF 570 AM | Dallas, TX | 0.92 |
| WSB 750 AM | Atlanta, GA | 0.92 |
| KTRH 740 AM | Houston, TX | 0.95 |
| WOAI 1200 AM | San Antonio, TX | 0.95 |
Instead of guessing which loan keywords or advertisers to research, we use live radio ads as a signal source.
| Data Type | Purpose |
|---|---|
| Station name | Know where the ad appeared |
| Market / state | Understand geographic opportunity |
| Timestamp | Identify airing time patterns |
| Audio chunk | Preserve source evidence |
| Transcript | Searchable ad text |
| Keyword matches | Detect relevant vertical |
| Confidence score | Prioritize review |
| Advertiser candidate | Identify possible brand / domain |
| Review status | Separate raw detections from approved intelligence |
| Queue status | Monitor pipeline health |
The system separates raw machine detections from reviewed business intelligence.
The system does not automatically assume every detection is usable. A human reviewer verifies important detections before they inform campaign planning.
This system is for research and intelligence. It does not automatically create ads, claim affiliation with advertisers, or approve trademark keyword usage.
The system discovers opportunities. It does not replace compliance review.
KPI values populate once the dashboard is connected to live backend endpoints.
The immediate goal is to turn raw radio detections into reviewed keyword opportunities that can support paid search testing.