Found in AI: AI Search Visibility, SEO, & GEO

Should You Block AI Bots From Your Website? | AJ Ghergich, Botify [Part 3 of 3]

• Cassie Clark • Episode 65

Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.

0:00 | 18:18

Send us Fan Mail

📬 You like this podcast? You’ll love the newsletter.
Join the weekly 3-2-1 on AI search + marketing: subscribe

Everyone's asking whether to block AI bots from their site. AJ Ghergich thinks that's the wrong question.

When AJ and I recorded this conversation, most companies were still treating AI bot governance as an IT problem — something to handle quietly at the server level. But the bots hitting your site aren't just an infrastructure cost. They're shaping what AI tells the world about your brand. And right now, most companies — including some of the biggest in the world — can't say what data of theirs is being used, whether it's accurate, or whether they have any plan at all.

AJ leads the consulting and AI team at Botify and has these conversations with global brands every week. His take is clear: AI bot governance isn't a "block or allow" switch. It's a set of decisions that requires marketing, IT, and legal to be at the same table. Because the moment your product data, your positioning, or your facts show up wrong in an AI answer, it stops being a technical problem and becomes a brand one.

In this episode:

  • Why AI bot governance falls through the cracks — and the case for why it's a marketing, IT, and legal problem, not any one team's job
  • The three questions every brand should be able to answer about their AI data — and why even some of the world's biggest brands can't
  • Training data vs. live retrieval — the distinction most brands miss, and why retailers need to win at the moment of retrieval, not just "get into ChatGPT"
  • The llms.txt question, settled: why it's an 11th-priority shiny object for 95% of brands — and what Google's new Lighthouse check does and doesn't actually change
  • What a good governance plan looks like versus a bad, binary one — including AJ's idea of "user testing, but for AI agents"
  • How to fix it when AI is telling the wrong story about you — like the cosmetics brand that got falsely labeled as not animal-friendly, and how to hunt down the source
  • Why your old, deprecated brand data — three-CMOs-ago mission statements, retired colors, and positioning — might be quietly confusing AI right now
  • Who actually needs to be at the table — and why "that's IT's problem" is the answer that gets brands in trouble

Let’s connect:

LinkedIn → Cassie Clark | Fractional Content Strategist
Website → https://cassieclarkmarketing.com

Download Freshness, Structure, Authority: The Framework for AI Search Visibility:

Amazon

P.S. Is your brand losing its "Answer Authority"?

Most series A/B and enterprise brands are being "nudged" out of AI search results because of entity gaps and "stale" content. I am opening a limited number of specialized audit slots to help you reclaim your Share of Voice using the FSA Framework (Freshness, Structure, Authority).

Request your 7-Day AI Search Visibility Audit: https://cassieclarkmarketing.com/ai-search-visibility-audit/

If you're listening to this and thinking I need someone to lead this for me, that's what I do.

I'm an AI search optimization expert and a fractional content strategist for startups and enterprise brands. If that sounds like the kind of help you're looking for, email me at cassie@cassieclarkmarketing.com.

Hey, welcome back to Found and AI. I'm Cassie Clark, a fractional content strategist, an AI search optimization expert, and the host of the show where we talk about AI search, GEO, AEO, and what all of it actually means so we don't get lost in this new wave of user search behavior. My cat is at the my feet under my desk and he's making a lot of noise. He's orange and he's sneezing. He's not the only one sneezing, I'm also sick. But today we're getting into something that does not get nearly enough airtime. AI bot governance. I don't mean that in the dry compliance y kind of way, but I mean it in the real messy question that every company is wrestling with right now. Which bots can access your site, what they're allowed to crawl, and whether the content they grab is used to train a model or to answer a customer in live retrieval. Spoiler, those are two very different problems with very different answers. This is the third part of the series with AJ Gergic. He leads consulting in the AI team at Botify. He's been having these conversations with some of the biggest brands in the world every day about what to do with these AI bots. We get into why governance falls through the cracks between marketing, IT, and legal. The three questions that most CMOs still can't answer what a good plan looks like versus a bad one. And I take us on a little side road about the LOMS.txt file and kind of thank goodness that I did at the time because Google is changing guidance on that by the minute, it feels like. Yeah, I I think you know governance means different things to different people, obviously. I think inside of a company right now, um the main question people are saying is AI governance marketing, IT, or like a legal problem? And then the to be the obvious answer, I guess, is it's all three. Um, and that's why it falls through the cracks. Um, so it's it's really a governance is a it's a set of decisions about what which bots can access your site, what they can crawl, um, whether the content that they get used is for training or for live retrieval for a customer. Um, you know, and everybody cares about it, but for different reasons. So like marketing cares because it affects visibility. IT cares because they have that issue where they're getting inundated, crushed with bot traffic that they have to serve, and that's a cost and an infrastructure problem versus humans, right? So it's like this huge load on the IT side. Um, and then legal cares because maybe your product data is showing up, you an AI you didn't authorize, maybe it's hallucinating facts, and now are you liable for that? It it said it was good for this, but it's not, right? It that could be a problem. Like, who's whose fault is that gonna be? Is it gonna be Open AI's fault or yours? Um, so everyone cares about it, but I think it's really um a combination right now of marketing IT and legal um just wrestling with those questions. And I I think that kind of leads to the obvious question of like, where do you start then? Right. I you know, yeah. Well, where do we start? Where do we start with that? So uh, you know, again at Botify, I get to I mentioned I get talked to a lot of a lot of folks, a lot of big companies, global companies, um, and and growth companies. Um, so this is the type of conversation I have all the time. Like, how do we start? What should we do? You know, that kind of thing. Um, I think it's you start with some simple questions. Um, so as a brand, do you know what data of yours is being used to train AI models? And if you don't have like if you have like an extremely loose answer to that, and you're you're not just a you know one woman shop, you know, like like you you should be able to answer that question as an organization. Um and then comes the the next question is that data accurate? Is it updated? Um and then do you have a plan in place for what you're gonna allow AI to retrieve for training and in real time? And I, like I said, talk to a lot of brands. Most cannot answer that question. Some of the biggest brands in the world, at least I would say a year ago, it's getting better now, could not answer those three questions. So I would ask a CMO, I'd ask her those three questions, couldn't answer it. So I don't want anybody who is like, I can't answer those questions to panic. Like you're not, it's not uncommon. But I do think it's you need to start thinking about that. Um, you need to start thinking about um what bots are hitting your site, how often, um, what information they're getting, all the way down to like, let's say you're not a technical person. Let's say you you're like, AJ, I'm a I I I work on branding. You know, I'm a brand manager. Okay. Well, what if OpenAI has three CMOs ago mission statements about your brand and its training data? How could that create confusion about what you say your mission is today, right? What about your fonts? Have the website, your colors, everything. Like, think about all the confusion that's out there that it may be having from old deprecated data about your brand and your positioning versus today's positioning. Like, so this is an everyone conversation. You can't just go, well, that's IT, let them deal with the bots. It's like, no, that's it's a brand conversation, right? Like, so it everyone should get involved. Um, and then the question I think becomes um, you know, who's running the show? Right, right. I mean, I would have been saying just for like AI search optimization in general, that this is not just a marketing thing. It is, but it touches every layer of the organization. So even with this conversation, it's still every layer of the organization. So let me ask you a question because I just was talking to a brand earlier in the week. She was asking me about the files that you might may or may not need on your website, talking about the robot.txt file and the LM LS file. Do we need those? So this is the new like SEO bar fight question. So it used to be um root domain versus subdomain, which I won't dork out, but um if I'll just say this if your if your IT ever asks you if you want to put some content on a subdomain or your main domain, put it on your main. So it just it it's it's better for everyone. But um this used to that used to cause all kinds of issues with inside the SEO community. This is the new version of that. Um the short answer is for 95% of brands, you don't need the LLM's TXT. Um it's not getting picked up um uh in mass. Google's representatives will come out and say they they're not using it. Um now that said, you could we track, we bot Botify tracks log files, we look at it. They do get used in Hit. It's just here's the thing there's 10 other things you should be doing. That this is the the 11th, right? And so that's that's where I'm just like when you're doing this, you're not doing something else. And I'd much rather you be doing something else than that, and then you got to maintain it and all that. It feels good, it's a shiny object, and it's like, ooh, we're cool. We have uh LLMs TXT, it's not gonna do you any good, it's not gonna ring the register. Um, I wouldn't do it for 95% of brands. Okay, that one hit my head, and I'm like, I have to ask it right now. But let's go back to the AI bot governance plan. What does a good plan look like versus what does a bad one look like? And when do you just call it done? Yeah, so I I think a bad plan is very binary. Like we block AI bots, um, you know, or we don't, that kind of thing. Um you need to have enough data at your fingertips to know what are good bots and what are bad bots. Um, and then what bots um are training bots and what bots are live retrieval. So the for a retailer, a lot of folks think I need to get my data into Chat GPT. Like I just need to know about my brand and like my product. But you gotta realize so while we're talking, maybe it's happening or maybe it's gonna happen in an hour, like ChatGPT is gonna release their new model 5.5, right? And so when that happens, the moment that happens, it's gonna be six months out of date, maybe a year out of date. And so my point is it doesn't know if those sneakers are in size 11, uh, if you know pink is in stock, it doesn't know, right? Um, and so most retailers actually need to ri win at the moment of live retrieval. When an when a search uh when an AI searches on a consumer's behalf to see if size 11 is in stock in pink at this price, what is the shipping? And so a lot of folks don't think about it that way, and they're not looking at it that way. Um, I would do it almost like a governance plan should include like user testing, but for AI agents, right? So can the AI agent at live retrieval see if it's in stock and peak in size 11 or not? Or does it have have trouble in accessibility? So um it's not a binary choice. You have to have data for the plan to work, and then honestly, it really should have a hub. Like I believe that um AI can be a hub for all of your spokes. Um, and so you know, you see a lot of folks putting a chief AI officer. You could also um even think about your uh chief uh customer officer becoming uh having AI in in her purview, perhaps, um, so that you you're connecting all of these different spokes under this AI hub because you're right, it's not just marketing, it's it's everybody. It's absolutely everyone. So I mean you make an interesting point about the training data and live retrieval being different. So if you're noticing that your training data or what the training data says about your brand is not correct, how are you going about fixing that? Yeah, so that's a it's a it's a bit of a longer cycle, obviously, right? Because like it's it, it's like it's it's um it's already out of the bag, like right, like it's it's already in the in the uh in the training set. Uh the first thing I would be doing is figuring out where the heck it got the information. So um, you know, I'll leave their name out of it. Uh, but uh a large uh cosmetics company I'm on the phone with them. We're we're this is maybe a year ago, going through um their AI visibility and looking. And essentially the AI started falsely saying that their cosmetics were not like animal friendly. And like, imagine the shitstorm that might cause, right? Like it's like like like that, that is a no, are you kidding me? And it's like, where is it getting the information? Because sometimes it's pulling the information from Reddit or a forum. Um, so some and sometimes it's your own information. Obviously, in this case, it wasn't their information, it was somebody else falsely stating. So the first thing you want to do is hunt down where is it getting the information and the facts? And that's why you need everybody at the table. You need marketing, brand, SEO, you know, you need everybody at the table because why is it why does it have our story wrong? So is it because it can't read JavaScript and it's getting half our content wrong? Is it because our data is telling three different stories? Is it because it's reading some sweaty Redditors post that is not accurate and we need to talk about PR and going in and correcting the record? So all of those things would be good. And then proactively is if you can get into a mindset, especially if you're e-commerce and retail, about pushing, um, getting as much information in your data feeds to push. Um, so you'll see in the next, you know, coming weeks and months, uh, all the major players will be uh adding agentic feeds uh for commerce. That's a great way to correct the record and add your data because you're pushing versus waiting for it to crawl and learn. So you're pushing your information in. So it's a complicated answer and not a perfect answer, but you've got to find the the source of the rot at first before you can do anything. So you said everyone at the table. Is there anyone that we're missing? So we're talking about brand, marketing, legal, IT. Anyone else? I honestly I would start, I would start there. I think um I think every organization is gonna be a little different. I I I wouldn't say um that's a closed group. There's gonna be there's gonna definitely need to be more people on the loop, but that's a good that's where I see 80, 90% of people starting is with those people at the table. And then I would widen the tent from there. Um but yeah, that's that's a you can't go too wrong with if that's your starting block. Okay, that was the final part of my conversation with AJ, and there are a few things that I want you to walk away with. AI bot governance isn't a binary block the bots or don't kind of decision. It's a set of decisions that needs marketing, brand, IT, and legal at the same table because no single one of them owns it. Start with AJ's three questions when you bring everybody to the table. First, do you know what data of yours is training the AI models? Is that data accurate and current? And do you have a plan for what you'll allow AI to retrieve both for training and in real time? If you can't answer those yet, you're in a good company. Everyone is still trying to figure it out, but that's the work that we need to start with. Now, quick update on my lolemass.dxc detour. The story has moved a little since we recorded that. Uh we recorded this like a month or so ago, and then now there's new guidance out. On the show, AJ's take was that for 95% of brands, that file is not worth your time. And then Google two weeks ago has said, hey, you don't need this for visibility. That's Google's official line still, but they've also added an LLMS.txt check to Gross Lighthouse under a new agentic browsing set of audits that flag whether your site even has the file. So naturally, people saw that and went, wait, I thought we didn't need this because it was announced within the same week of Google saying you don't need it. Here's the nuance with all of this. Those audits are about how ready your site is for AI agents and browser tools, not just search rankings. Google's John Mueller framed it as the difference between discovery, getting found, and functionality, meaning helping an agent do its job once they're already it's already on your page. So it doesn't really contradict AJ's point for most brands. If you're a developer docs or heavy agent traffic site, it might be worth looking into putting this on there. For everyone else, it's the same answer. There are 10 more important things to do first. If this helped you, this episode helped you any hit subscribe. I would love you forever. I would love you even more if you had 30 seconds and you could just leave a review. Those reviews do help the show get in front of more marketers and founders who are trying to navigate this stuff. And if you're starting your content strategies this week and you're thinking, okay, where do I even start with all of this? Head over to CassieClark Marketing.com to get an AI search visibility audit. I'll show you exactly where you're showing up, where you're invisible, what to do about it, why your competitors are winning. I hate saying that, but why they're winning, and how you can beat them. Alright, that's it for this week. I will see you in the next episode. Until then, stay visible.