Protect Your Content From AI

As AI-generated content explodes across the internet, many artists, writers, and creators are asking the same question:


“How do I stop my work from being scraped and used to train AI models?”

It’s a valid concern. AI systems learn by analysing huge amounts of data—including publicly available art, blog posts, social media content, and more. If you’ve published work online, chances are it’s already been accessed by bots, unless you’ve taken steps to block it.

Here’s what you can do to protect your creative work from being used without your permission.

1. Use Robots.txt to Block AI Crawlers

AI companies train models using data scraped from the web. You can limit access to your website using the robots.txtfile—a simple bit of code that tells bots which parts of your site they’re not allowed to index.

To block specific AI crawlers, add lines like:

User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

User-agent: CCBot
Disallow: /

This doesn’t guarantee protection, but it’s a clear signal that your content is off-limits—legally and ethically.

💡 Where to add it: Your website’s root directory (e.g. www.yoursite.com/robots.txt)
💡 Note: This only works on websites you own—not social media or third-party platforms.

2. Opt Out of AI Crawling (Where Available)

Some platforms are starting to offer opt-out options for AI training. Check if the sites you publish on have controls or toggles to block data access from AI models.

For example:

  • WordPress (some plugins allow AI-crawler blocking)

  • ArtStation offers an AI opt-out flag for images

  • Substack and some blog platforms now let you block GPTBot and similar agents

Stay informed as more platforms introduce these controls. If yours hasn’t yet, consider reaching out to ask for them.

3. Add Clear Copyright Notices

AI tools are more likely to avoid scraping sites with visible legal statements. Make your intentions clear:

  • Add a copyright statement to your site footer and image captions

  • Include “Do not use for AI training” in your content policy

  • Watermark your artwork with subtle copyright text or logos

💡 While this won’t stop bad actors, it strengthens your case in any potential disputes and deters casual misuse.

4. Be Mindful of Where You Post

Unfortunately, once your content is online—especially on large, public platforms—it’s harder to control who accesses it.

To stay safer:

  • Avoid posting high-res files to public galleries unless you trust the platform

  • Use private or password-protected portfolios for client work

  • Share cropped or lower-resolution previews where appropriate

5. Support and Follow Legal Developments

The laws around AI training and copyright are evolving quickly. Some creators and organisations are already challenging the use of their content in court. The more visible and vocal you are, the more pressure there is for fairer systems.

You can:

  • Support organisations like the Authors Guild, Creative Commons, or Fight for the Future

  • Follow cases involving AI and copyright (Getty Images, Sarah Andersen vs. Stability AI, etc.)

  • Keep your own work registered and documented for copyright protection

Final Thoughts

AI is here to stay—but that doesn’t mean your creative work should be used without permission. With a mix of technical measures, legal awareness, and strategic publishing, you can take control over how your content is used (or not used) in this fast-changing digital landscape.

Next
Next

Optimise Your Business For AI Marketing