Google News Scraper
by lhotanova
Need to track news stories as they break? This Google News scraper pulls the top featured articles directly from Google's news aggregator, giving you ...
Opens on Apify.com
About Google News Scraper
Need to track news stories as they break? This Google News scraper pulls the top featured articles directly from Google's news aggregator, giving you clean, structured data without the manual hassle. It captures exactly what you'd want: article titles, direct links, source publications, timestamps, and thumbnail images. I use it to monitor coverage on specific topics, track competitors, or gather data for media analysis projects. The structured output makes it easy to import into spreadsheets, databases, or dashboards. It's straightforward to set upβjust specify your search query or topic of interest. The actor handles the rest, returning a reliable dataset that's perfect for researchers, marketers, or anyone who needs a steady stream of organized news data. If you're tired of copying and pasting from Google News, this automates the tedious part and lets you focus on the insights.
What does this actor do?
Google News Scraper is a web scraping and automation tool available on the Apify platform. It's designed to help you extract data and automate tasks efficiently in the cloud.
Key Features
- Cloud-based execution - no local setup required
- Scalable infrastructure for large-scale operations
- API access for integration with your applications
- Built-in proxy rotation and anti-blocking measures
- Scheduled runs and webhooks for automation
How to Use
- Click "Try This Actor" to open it on Apify
- Create a free Apify account if you don't have one
- Configure the input parameters as needed
- Run the actor and download your results
Documentation
Google News Scraper ## What data does Google News Scraper collect? Google News Scraper allows you to extract news metadata such as title, link, source, publication datetime and image. ## Query-based search The Actor provides a simple interface where you can define your search query, language, region, and date range. Unlike manually browsing Google News, which limits you to approximately 100 results per search, Google News Scraper allows you to retrieve significantly more results without restrictions. To bypass the usual result limits, the scraper can automatically filter news articles by date. If you set maxItems to a value greater than 100, the scraper will fetch news articles day by day until it reaches the specified limit. For more precise control over the date range, you can use the dateFrom and dateTo fields in the YYYY-MM-DD format. Alternatively, you can specify an open-ended date range using the openEndedDateRange field with values such as: - 1h β retrieves news from the past hour - 7d β retrieves news from the past week - 1y β retrieves news from the past year You can combine numbers with the h, d, or y shortcuts to customize your date range as needed.
### Advanced search filters You can also use advanced search operators in your queries, such as intitle, inurl, site, exclude operator -, exact match with double-quotes "", AND, OR and more. Example queries with advanced operators: | Query | Explained | | ------------------------------------------------------ | ------------------------------------------------------------ | | intitle:"AI" AND site:bbc.com | Finds articles with "AI" in the title from BBC. | | site:reuters.com "stock market" -crypto | Finds stock market articles on Reuters, excluding crypto-related ones. | | "Samsung Galaxy S25" AND (review OR comparison) | Searches for reviews or comparisons of Samsung Galaxy S25. | | site:nytimes.com intitle:"election" after:2025-01-01 | Retrieves recent NY Times articles with "election" keyword in the title. | | inurl:blog OR inurl:news "climate change" | Searches for climate change mentions in blog or news URLs. | You may already know some of the operators from Google search or Apify's Google Search Scraper. For more information, see Google Guide on Search Operators. ## Topic-based search Google News Scraper also supports topic-based search. You can search for news from one of the topics predefined in topics input field: - WORLD π - NATION π© - BUSINESS πͺ - TECHNOLOGY π» - ENTERTAINMENT πΈ - SPORTS π - SCIENCE π§ͺ - HEALTH π§ββοΈ
### Hashed Topics The scraper also supports hashed topics, allowing you to extract news based on specific topic IDs from Google News. Simply copy the topic ID from the Google News URL and paste it into the topicsHashed input field. For example, if you want to find news about your home country but published by foreign sources or in different languages, you can: 1. Click on your country in Google News. 2. Copy the topic ID from the URL. 3. Paste it into the topicsHashed array. 4. Set the language field to your preferred language and region (e.g., DE:de). This way, you can for example retrieve the latest news from the US but published by German sources like zdf.de or spiegel.de. Similarly, you can search for news about a non-English-speaking country and get results from international sources such as reuters.com, bbc.co.uk, independent.co.uk, or variety.com.
### Topic Sections The scraper also allows you to refine your search by targeting specific sections within broader topics. For example, you can extract news from the NHL section under Sports or Artificial Intelligence within the Technology category. Imagine you need to scrape AI-related news from the section Technology > Artificial Intelligence, which is available at the following URL: https://news.google.com/topics/CAAqJggKIiBDQkFTRWdvSUwyMHZNRGRqTVhZU0FtVnVHZ0pWVXlnQVAB/sections/CAQiQ0NCQVNMQW9JTDIwdk1EZGpNWFlTQW1WdUdnSlZVeUlOQ0FRYUNRb0hMMjB2TUcxcmVpb0pFZ2N2YlM4d2JXdDZLQUEqKggAKiYICiIgQ0JBU0Vnb0lMMjB2TURkak1YWVNBbVZ1R2dKVlV5Z0FQAVAB?hl=en-US&gl=US&ceid=US%3Aen Simply copy the {TOPIC_ID}/section/{SECTION_ID} part of the URL and paste it into the topicsHashed array to target that specific section: json { "topicsHashed": [ "CAAqJggKIiBDQkFTRWdvSUwyMHZNRGRqTVhZU0FtVnVHZ0pWVXlnQVAB/sections/CAQiQ0NCQVNMQW9JTDIwdk1EZGpNWFlTQW1WdUdnSlZVeUlOQ0FRYUNRb0hMMjB2TUcxcmVpb0pFZ2N2YlM4d2JXdDZLQUEqKggAKiYICiIgQ0JBU0Vnb0lMMjB2TURkak1YWVNBbVZ1R2dKVlV5Z0FQAVAB" ] }
## Example output To obtain a detailed output with decoded article links and image URLs, set the fetchArticleDetails input field to true. Extracted news are then stored in a following format: json [ { "title": "A win at last: Big blow to AI world in training data copyright scrap", "link": "https://www.theregister.com/2025/02/12/thomson_reuters_wins_ai_copyright/", "guid": "CBMifkFVX3lxTFByV0JDX09JaDR4bmNfX3AzWVFRbjMwdWNKcmtWQ255MEVGa2lCTHBpTE5JbFFWMG5nVFE1T2M0alVfY0N2N3ZFdFJDV3FrUjhVeDZ2dkQxQUNwb1FPcm9kMV9KYXk0ZUNnN1BDNDlZUXZtY3ktemgwWkpNU3Nxd9IBgwFBVV95cUxPc2tobXlKUmcwR0wyR0podklDS2MzMDhpQTd6N2NVbnRqYndPSVVmckV1Tk82TkNDck1JbmFuUU0yUjByTDE4VFRGVC1jTUc2M0VPOE1wX1RUZWpfOG9DamVmX21lWERuU0JGcHJyVDZQdEFDTHNNZjFQV1BQNkN0SjhaSQ", "source": "The Register", "sourceUrl": "https://www.theregister.com", "publishedAt": "2025-02-12T01:45:00.000Z", "loadedUrl": "https://www.theregister.com/2025/02/12/thomson_reuters_wins_ai_copyright/", "rssLink": "https://news.google.com/rss/articles/CBMifkFVX3lxTFByV0JDX09JaDR4bmNfX3AzWVFRbjMwdWNKcmtWQ255MEVGa2lCTHBpTE5JbFFWMG5nVFE1T2M0alVfY0N2N3ZFdFJDV3FrUjhVeDZ2dkQxQUNwb1FPcm9kMV9KYXk0ZUNnN1BDNDlZUXZtY3ktemgwWkpNU3Nxd9IBgwFBVV95cUxPc2tobXlKUmcwR0wyR0podklDS2MzMDhpQTd6N2NVbnRqYndPSVVmckV1Tk82TkNDck1JbmFuUU0yUjByTDE4VFRGVC1jTUc2M0VPOE1wX1RUZWpfOG9DamVmX21lWERuU0JGcHJyVDZQdEFDTHNNZjFQV1BQNkN0SjhaSQ?oc=5", "image": "https://regmedia.co.uk/2021/08/02/shutterstock_robot_justice.jpg" }, { "title": "Web Scraping Optimization: Tips for Faster, Smarter Scrapers", "link": "https://hackernoon.com/web-scraping-optimization-tips-for-faster-smarter-scrapers", "guid": "CBMiiAFBVV95cUxQUXh5WVZ2RkNpNG9ndjF6V3hMRHBRTGRSVnNkelpwZDY2TWJzejBSMGZrRC1rSm5DZ1BxanpoeFFGdDRjWGpZR0tOUG9FY0kyeWFXOE9MSzBobTg1ajRiZzVhSWhtbm5nSVNJVWExSDBSaEFjUUJkT1JRRDJHSDBrMU9jU2ZZN3RN", "source": "hackernoon.com", "sourceUrl": "https://hackernoon.com", "publishedAt": "2024-11-15T08:00:00.000Z", "loadedUrl": "https://hackernoon.com/web-scraping-optimization-tips-for-faster-smarter-scrapers", "rssLink": "https://news.google.com/rss/articles/CBMiiAFBVV95cUxQUXh5WVZ2RkNpNG9ndjF6V3hMRHBRTGRSVnNkelpwZDY2TWJzejBSMGZrRC1rSm5DZ1BxanpoeFFGdDRjWGpZR0tOUG9FY0kyeWFXOE9MSzBobTg1ajRiZzVhSWhtbm5nSVNJVWExSDBSaEFjUUJkT1JRRDJHSDBrMU9jU2ZZN3RN?oc=5", "image": "https://hackernoon.imgix.net/images/0FC9YtxD4fbD3T7mPipOt4HSxY42-7y034nb.png" } ] > NOTE: Image URLs are retrieved directly from target article websites and may differ from those displayed on the Google News website. The scraper relies on the Google News RSS API, which does not include image links. However, using the RSS API instead of web browsing allows the scraper to efficiently gather all data through simple HTTP requests, making the process faster and more cost-effective. If you set fetchArticleDetails to false, the scraper will run significantly faster and at a lower cost, but the output will only include URLs in the RSS feed format, such as: https://news.google.com/rss/articles/CBMiiAFBVV95cUxQUXh5WVZ2RkNpNG9ndjF6V3hMRHBRTGRSVnNkelpwZDY2TWJzejBSMGZrRC1rSm5DZ1BxanpoeFFGdDRjWGpZR0tOUG9FY0kyeWFXOE9MSzBobTg1ajRiZzVhSWhtbm5nSVNJVWExSDBSaEFjUUJkT1JRRDJHSDBrMU9jU2ZZN3RN?oc=5 The output will be simplified to include RSS links only and exclude preview images: json [ { "title": "How I used Scrapy and Scrapy Playwright to take screenshot", "link": "https://news.google.com/rss/articles/CBMigAFBVV95cUxQRDJDNW1ONklDTlhJN3lsWlIwdTEzV050d284bnZ4YWo2TnQ3MU1XcUZkdHNZTTMxSTZ4NjkwRnh0UFloa0ZsZ0d5Qmx0NjRFd25qMXdBTmN6TVBmcHdKdXhGWnRCSDVMdjVJZlJkTlRTMjZSZXZwdGNYM28xbE9uTA?oc=5", "guid": "CBMigAFBVV95cUxQRDJDNW1ONklDTlhJN3lsWlIwdTEzV050d284bnZ4YWo2TnQ3MU1XcUZkdHNZTTMxSTZ4NjkwRnh0UFloa0ZsZ0d5Qmx0NjRFd25qMXdBTmN6TVBmcHdKdXhGWnRCSDVMdjVJZlJkTlRTMjZSZXZwdGNYM28xbE9uTA", "source": "substack.com", "sourceUrl": "https://substack.com", "publishedAt": "2024-06-09T07:00:00.000Z" }, { "title": "Beyond the Buzz: Making AI Work for Large-Scale Web Data Collection", "link": "https://news.google.com/rss/articles/CBMisgFBVV95cUxPVXVJeGNPTWI0bXVmOWdSc1B5MVpDRWVnX1R3QV8wdzRmb2VNU1hneU9wNEZrMGdOT2Fqc1Rremg1R0lmM1NBdjZJa3llOXc2bDVzN3RwNVppZnJ3Q1FTS1NsRUVxOTlwWl9QNlJYMUpPYUxKeFVRSWpCYXNaOVU2aFZ0RWNzd200cmRRQlJoUE8tcnU2bXNIQkZNOXUxRDBzTXAxSEMtMG5lLTRsdlVWZFRB?oc=5", "guid": "CBMisgFBVV95cUxPVXVJeGNPTWI0bXVmOWdSc1B5MVpDRWVnX1R3QV8wdzRmb2VNU1hneU9wNEZrMGdOT2Fqc1Rremg1R0lmM1NBdjZJa3llOXc2bDVzN3RwNVppZnJ3Q1FTS1NsRUVxOTlwWl9QNlJYMUpPYUxKeFVRSWpCYXNaOVU2aFZ0RWNzd200cmRRQlJoUE8tcnU2bXNIQkZNOXUxRDBzTXAxSEMtMG5lLTRsdlVWZFRB", "source": "GeekWire", "sourceUrl": "https://www.geekwire.com", "publishedAt": "2024-11-26T08:00:00.000Z" } ]
Categories
Common Use Cases
Market Research
Gather competitive intelligence and market data
Lead Generation
Extract contact information for sales outreach
Price Monitoring
Track competitor pricing and product changes
Content Aggregation
Collect and organize content from multiple sources
Ready to Get Started?
Try Google News Scraper now on Apify. Free tier available with no credit card required.
Start Free TrialActor Information
- Developer
- lhotanova
- Pricing
- Paid
- Total Runs
- 2,753,376
- Active Users
- 2,435
Related Actors
Smart Article Extractor
by lukaskrivka
Google Search
by devisty
Twitter Tweets Scraper
by gentle_cloud
Twitter Profile
by danek
Apify provides a cloud platform for web scraping, data extraction, and automation. Build and run web scrapers in the cloud.
Learn more about ApifyNeed Professional Help?
Couldn't solve your problem? Hire a verified specialist on Fiverr to get it done quickly and professionally.
Trusted by millions | Money-back guarantee | 24/7 Support