Your Reviews Are Converting Humans. They're Not Convincing the Agent.

Share

Something I've been sitting with for a while is the gap between how brands think about reviews and what reviews are actually doing in an agentic commerce environment.

Most ecommerce teams I work with have a reasonably sophisticated onsite review strategy. Star ratings above the fold. Review carousels on product pages. Post-purchase flows designed to generate volume. UGC pulled through to product cards. The whole stack built around a human visitor who lands on the site already considering the brand and needs reassurance before they commit.

That stack still matters. It still converts. But it's solving a different problem from the one I want to talk about here.

Onsite reviews convert humans who are already considering you. Third party reviews convince AI shopping agents to put you in the consideration set before any human has seen your site at all. These are not the same problem. And most brands are only building infrastructure for one of them.

This pattern is older than Google

Before I get into the mechanics, I want to put this in context because I think the historical parallel is genuinely useful.

When Yahoo launched its directory in 1994, the question it was trying to answer was exactly the same one AI shopping agents are trying to answer today. How do you decide which sources to trust when you can't verify everything directly? Yahoo's answer was editorial. Human curators evaluated sites and decided which ones were credible enough to list. Trust was centralised and deliberate. It worked until the web scaled beyond any editorial team's ability to keep up.

Google's answer was different. Instead of centralised editorial judgement, PageRank distributed the trust problem across the web itself. A backlink is one site staking its own credibility on another. TrustRank refined this further by seeding from a small set of known trusted domains and measuring how many steps removed any given site was from that trusted set. The insight was elegant: manipulation at scale is structurally hard when trust has to be conferred by independent third parties who have their own reputation to protect. You can't build fake authority without real sites choosing to point at you. And real sites with real authority don't do that carelessly.

The brands that understood PageRank early built genuine link equity and compounded that advantage for years. The brands that tried to game it with link schemes got penalised and spent considerable time recovering. The brands that waited to see how it settled found that the window to establish authority had already closed and catching up was slow and expensive.

AI shopping agents are running the same architecture. Not as backlinks but as the same underlying logic. The agent doesn't trust what you publish on your own domain. It looks for independent entities that have chosen to stake their own credibility on your brand by referencing it. The mechanism is different. The epistemology is identical.

Why self-reported signals carry less weight

This is the thing most brands haven't fully absorbed yet.

An AI shopping agent approaches a brand's own website the way a sceptical researcher approaches a company's own press releases. Useful background. Not sufficient evidence.

Anything published on a domain the brand controls sits in a category agents treat as self-reported. That includes onsite reviews, even genuine ones. The agent can read them. If they're properly marked up with Review schema, the agent can extract ratings, volumes, and sentiment. But it defines those signals differently from signals that originate on platforms the brand doesn't control, can't edit, and can't delete.

This isn't a flaw. It's a deliberate design position. Self-reported signals are the easiest to fabricate. An agent that weighted them equally to independent signals would be trivially easy to manipulate, and the platforms building these agents know it.

So the agent looks past what you say about yourself toward what independent sources say about you. And the question for most fashion brands right now is: what do independent sources actually say?

The three signals agents treat as genuinely independent

I want to be clear that I'm still working out exactly how different agents weight these relative to each other. The resolution logic isn't fully visible from the outside and it varies by platform. But the direction is consistent enough that I'm confident in the framework even if the precise weighting is still evolving.

Third party review platforms with established domain authority.

Trustpilot sits at a domain authority of 95 and carries somewhere north of 350 million reviews. Google Reviews is native to the search infrastructure AI shopping agents already rely on. Both platforms are structurally difficult to manipulate at meaningful scale. Both have their own commercial and reputational incentives to maintain signal integrity. Agents know this, and they treat reviews on these platforms differently from reviews on a brand's own site.

When a brand has a substantial, consistent, actively managed presence on Trustpilot or Google Reviews, it tells the agent something specific. Real customers have transacted with this brand. Enough of them cared enough to say something about it in a place the brand doesn't control. The volume, recency, and sentiment of those signals all contribute to how confidently the agent can characterise the brand's customer experience when building a recommendation.

When those profiles are thin, inactive, or inconsistently managed, the agent is working with much less to go on. It may still surface the brand. But the confidence level attached to that recommendation is lower, and lower confidence recommendations get displaced when a competitor has built stronger independent signals.

Press and editorial coverage.

References to a brand in independent editorial contexts such as trade press, fashion titles, mainstream media, credible blogs with established readership, act as corroborating signals in the same way backlinks did for PageRank. A third party with its own credibility at stake has chosen to associate itself with the brand. That choice carries weight precisely because it isn't free. Editorial coverage requires either something genuinely interesting happening at the brand or a PR operation that has earned attention. Agents can increasingly identify and discount paid placements and sponsored content, which is worth keeping in mind when thinking about how press activity actually builds entity authority.

Consistent entity signals across independent directories and platforms.

This is the one most brands haven't thought about at all. The degree to which what your Organisation schema claims about your brand matches what Google Business Profile, Companies House, Trustpilot, social profiles, and independent directories independently say about you. When these align using the same legal name, same address, same website, same founding story, the agent has a corroborated picture of a real, stable, verifiable entity. When they conflict, or when half of them are absent, the picture has gaps.

Where Organisation schema fits into this

Organisation schema on its own is a self-reported signal. The brand writes it, publishes it on its own domain, and the agent reads it. Useful. Not sufficient on its own.

What Organisation schema actually does in this context is provide the identity anchor that allows the agent to match the brand's self-reported identity against independent corroboration. The sameAs property is the mechanism. It tells the agent: the entity described on this domain is the same entity as this Trustpilot profile, this Google Business listing, this Instagram account, this Wikipedia entry if one exists.

When those links hold and the agent follows them to find consistent, active, brand-owned profiles with genuine signals of real commercial activity, the Organisation schema elevates from a self-reported claim to something closer to a verified identity. The agent has used the schema as a starting point and then confirmed it through independent sources it trusts.

When sameAs links are broken, point to rebranded social handles that now 404, or lead to thin and inactive pages, the identity anchor fails. And an unverified identity claim is weaker in some ways than no claim at all, because it raises the question of why the verification attempt failed. Shopping agents hate uncertainty.

I've audited brands where the sameAs array in their Organisation schema pointed to an Instagram handle that was changed during a rebrand about eighteen months ago. Nobody on the current team knew the schema existed, let alone that it was pointing somewhere that no longer resolved to the brand. That kind of gap doesn't show up in GA4. It doesn't trigger a Search Console error. It just sits there quietly undermining the identity verification the agent is trying to run.

Why fashion brands have a specific version of this problem

Review volume in fashion tends to be lower relative to transaction volume than in most other ecommerce categories. Customers buying from premium or streetwear brands are often less likely to leave a review than customers buying commodity products. The review signal is thin even when sentiment is genuinely positive. I'd say this is one of the more under appreciated structural disadvantages fashion brands carry into agentic commerce. They have fewer natural review signals to work with and they've historically underinvested in generating them through third party platforms because the onsite conversion argument was sufficient.

The platform consistency problem is also worse in fashion than most teams realise. Some brands have a reasonable Trustpilot presence and almost nothing on Google Reviews. Others have scattered reviews across four or five platforms with no consistent, actively managed profile on any of them. The agent can't build a coherent picture from fragmented signals. What it needs is depth on a small number of high-authority platforms, not shallow presence everywhere.

And brand identity across third party platforms tends to be poorly governed in this category. A rebrand changes the trading name but the Trustpilot profile still uses the old one. The Google Business Profile has an address that's two fulfilment partners out of date. The Companies House registration uses the full legal entity name and the Trustpilot profile uses the shortened trading name and they don't obviously match. Every one of these is a corroboration gap. None of them feel urgent when the site is converting well and human traffic is healthy.

The coordination problem underneath all of this is the same one I see across most structured data work. It lives in the gap between technical SEO, brand, and operations. None of those teams feels fully responsible for it. So it doesn't get done, and nobody knows it's not done because the consequences aren't visible in any dashboard anyone is looking at.

What to check before you assume you're in good shape

This isn't a comprehensive audit. It's the minimum I'd want to run before telling a brand their third party trust signals are working.

Pull up the brand's Trustpilot profile. Is it claimed? Is it actively managed, meaning someone is responding to reviews? What's the review velocity over the past three months? If the last review is from eight months ago and there are fewer than fifty total, the signal is thin regardless of the average rating.

Check the Google Business Profile. Is it verified? Does the business name match the legal entity name in the Organisation schema? Does the address match? Does the website URL match? These three things should be identical. They often aren't.

Go through every URL in the sameAs array of the Organisation schema. Follow each one. Does it resolve? Does it lead to an active, brand-owned profile? Does the profile look like a real, operating brand? Do this manually. Validators won't catch a link that resolves but points to an abandoned account.

Then query the brand name directly in ChatGPT, Perplexity, and Google AI Mode. Ask what the brand's customer experience is like. Ask whether it's a trustworthy place to buy. Look at what the agent says and where it's drawing that characterisation from. If the response is thin, hedged, or pulls from the brand's own site rather than independent sources, the independent signal isn't strong enough yet.

This is infrastructure, not marketing

The reason I've spent time on the historical parallel is that I think it changes how teams should be thinking about this.

When Google TrustRank established that trusted seed domains conferred authority through links, the brands and publishers who understood that early built something durable. Not because they were clever about gaming a system but because they understood that genuine third party endorsement compounds over time in a way that self-reported signals don't. Each review on Trustpilot is a small increment of independent corroboration. Each editorial mention is another third party staking its credibility alongside yours. The agent's confidence in recommending your brand builds the same way PageRank built domain authority: slowly, through accumulated independent signals, in a way that's very hard to replicate quickly once you've fallen behind.

Trustpilot and Google Reviews sit in the agentic trust layer for the same structural reason that high-authority domains mattered in the backlink era. Their authority comes from being independent, from being hard to manipulate at scale, and from carrying reputational risk if they're gamed. That's not an accident and it's not going to change.

The brands treating their third party review presence as infrastructure and governing it with the same rigour they apply to product data, making sure sameAs links hold, making sure platform profiles are consistent and actively managed are building something that compounds. The brands treating it as a CRM metric that feeds an onsite carousel are solving for a conversion problem that is increasingly less important than the consideration problem forming upstream.

History is fairly clear on how this tends to go. The window to build genuine authority is early. The penalty for trying to shortcut it is real. And the brands waiting to see how the agent landscape settles are making the same bet the brands waiting for PageRank to stabilise made in 2003.

Most of them regretted it.