Which user-agent does Google use for Shopping ads?

Google uses several user-agents, including Googlebot (for general verification), Googlebot-Image (for product photos), and AdsBot-Google (for landing page quality and policy compliance).

Can I block Googlebot in robots.txt and still run Shopping ads?

No. If you block Googlebot or Googlebot-Image, Google cannot verify your product data or images, which will lead to immediate product disapprovals in Merchant Center.

How often does Google crawl my product landing pages?

Google crawls pages whenever a feed is updated (validation crawl) and periodically in the background (refreshing crawl). The frequency depends on your site's authority and how often your data changes.

Does Google render JavaScript when crawling for Shopping ads?

Yes, Google's crawlers can render JavaScript, but it is more reliable to provide critical product data like price and availability in the initial HTML or through Server-Side Rendering (SSR).

How Google Crawls Shopping Ads Landing Pages: Technical Guide

When you submit a product feed to Google Merchant Center, you are providing a snapshot of your catalog. However, Google does not take this data at face value. To ensure the integrity of the Shopping ecosystem, Google employs a sophisticated crawling infrastructure that visits your landing pages to verify that the "promise" made in your feed matches the "reality" on your website.

Understanding the mechanics of these crawlers is essential for technical SEOs and performance marketers who need to debug price mismatches and account-level disapprovals.

Identifying the Shopping Crawlers

Google uses several different user-agents to interact with your site, each with a specific purpose. Confusing these with standard web search crawlers is a common error in server log analysis.

1. Googlebot & Googlebot-Image

These are the primary crawlers used for both web search and Shopping. In the context of Shopping ads, they are responsible for:

Data Verification: Confirming price and availability via Structured Data.
Image Processing: Crawling the image_link and additional_image_link attributes to ensure they meet quality standards.

2. AdsBot-Google (and AdsBot-Google-Mobile)

These crawlers are specifically tasked with checking landing page quality and policy compliance for Google Ads. Unlike the general Googlebot, AdsBot-Google does not index pages for search; it purely evaluates the ad destination.

3. Googlebot-Image

Specifically retrieves images for display in Shopping ads. If this bot is blocked, your products will remain disapproved because Google cannot verify or cache the product image.

The Two Types of Shopping Crawls

Google does not crawl your site at a fixed interval. Instead, crawls are triggered by two distinct logic paths:

Validation Crawls (Post-Upload)

Whenever you upload a new feed or update an existing one, Google triggers a validation crawl for a subset of your products. This is a high-priority check to ensure that the new data is technically sound and matches the landing page.

Refreshing Crawls (Background)

Even if your feed remains static, Google will periodically recrawl your landing pages. This is the mechanism behind Automatic Item Updates. If the crawler detects a change on the page that isn't in the feed, it may "patch" the data or flag an error.

Technical Requirements for Successful Crawling

For Google to verify your data, your infrastructure must be accessible and readable by their bots.

Robots.txt and Crawl Budget

It is surprisingly common for developers to accidentally block AdsBot-Google or Googlebot in the robots.txt file, especially on staging environments that are mistakenly pushed to production.

Anti-pattern: Blocking all bots except Googlebot.
Best Practice: Explicitly allow AdsBot-Google to ensure ad-specific validation isn't hindered.

JavaScript Rendering (SSR vs. CSR)

Google's crawlers can render JavaScript, but it is "expensive" and can lead to timeouts. If your product prices are injected purely client-side (Client-Side Rendering) after the initial HTML load, there is a risk that the crawler will see a placeholder or a default price instead of the actual price.

The Solution: Use Server-Side Rendering (SSR) or Static Site Generation (SSG) for critical product attributes like price and availability.

Handling URL Parameters

Google Shopping often appends parameters to your URLs (like ?srsltid=... or ?gclid=...). If your server treats these as unique pages and fails to serve the correct product content, or if your canonical tags are misconfigured, the crawler may fail to associate the page with the product in your feed.

The Role of Structured Data (Schema.org)

Google's crawlers do not "read" your page like a human. They rely on the semantic layer provided by Schema.org structured data.

The crawler specifically looks for the Product schema and the nested Offer object. If your price attribute in the schema is formatted incorrectly (e.g., including the currency symbol inside the price string), the crawler will fail to extract the data, leading to a "Mismatched value" error even if the price is clearly visible on the page.

Observability: Monitoring Crawl Activity

You can monitor how Google's Shopping bots interact with your site through:

GMC Diagnostics: The "Automatic Item Updates" report shows when Google has patched data based on a crawl.
Search Console URL Inspection: This allows you to see the "Crawled Page" as Google sees it, which is invaluable for debugging rendering issues.
Server Logs: Filtering by the user-agents mentioned above can reveal if your server is returning 429 (Too Many Requests) or 5xx errors specifically to Google's bots.

A dedicated feed layer (e.g. 42feeds) helps you align your feed data with your crawling reality, ensuring that the signals you send to Google are consistent with what their bots will find on your site.

How Google Crawls Landing Pages for Shopping Ads