With all the talk about ChatGPT and other AI bots, did you know that there is an OpenAI ChatGPT bot and it respects the robots.txt protocol? So if you want, you can block OpenAI’s ChatGPT bot from crawling, indexing and using your content and data from your website. This will block the ChatGPT plugins specifically.
Mike King spotted this and posted about it on Twitter, you can see the official documentation over here. It reads:
ChatGPT-User is used by plugins in ChatGPT. This user-agent will only be used to take direct actions on behalf of ChatGPT users and is not used for crawling the web in any automatic fashion.
User agent token: ChatGPT-User
Full user-agent string: Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; ChatGPT-User/1.0; +https://openai.com/bot
To allow plugins to access your site you can explicitly add the ChatGPT-User to your site’s robots.txt:
Here is a screenshot of the document, in case it changes in the future:
Again, if you don’t want OpenAI to use your site’s data for its AI and ChatGPT, you can disallow it in your robots.txt file. Although, I am not sure how real-time this is and if once OpenAI consumed the content, will it then remove it after-the-fact?
Forum discussion at Twitter.
Google has launched a new advertising program called Performance Max for Marketplaces, making it easier…
Here is a recap of what happened in the search forums today, through the eyes…
Why do visitors who start on service pages convert into leads? Because they have commercial…
Have you ever wondered how some brands appear in front of just the right audience…
According to data from GS Statcounter, Google’s search engine market share has fallen to 86.99%,…
Google Ads, Shopping Ads, Admob, and other Google Ads products will soon disallow deepfake sexual…
This website uses cookies.
Leave a Comment