# robots.txt for patrickaudley.com # # Policy: open access. Every crawler — search, archive, scraping, # AI training, agent — is welcome to full content. There are no # Disallow rules. # # Machine-readable companions: # Linked-data graph: /index.jsonld # LLM overview: /llms.txt (curated index) # LLM full corpus: /llms-full.txt # Markdown alternate: /index.md # VoID description: /.well-known/void.ttl Sitemap: https://patrickaudley.com/sitemap.xml # ------------------------------------------------------------------ # Open default — allow all, disallow nothing. # ------------------------------------------------------------------ User-agent: * Allow: / Disallow: # ------------------------------------------------------------------ # Content Signals (W3C proposal) — declare usage preferences. # Placed after the first User-agent block so strict parsers that # reject unknown directives before any group don't choke. # ------------------------------------------------------------------ Content-Signal: ai-train=yes, search=yes, ai-input=yes # ------------------------------------------------------------------ # Explicit AI-training opt-ins. # # Several large vendors default to OPT-OUT for AI/LLM training when # no specific directive exists for their training user-agent token. # Naming each one explicitly is the only way to confirm consent to # train on this site. Listed here for that reason; standards-bot # crawling is already covered by the wildcard above. # ------------------------------------------------------------------ User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ClaudeBot Allow: / User-agent: Claude-User Allow: / User-agent: Claude-SearchBot Allow: / User-agent: anthropic-ai Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / User-agent: Applebot-Extended Allow: / User-agent: Bytespider Allow: / User-agent: CCBot Allow: / User-agent: cohere-ai Allow: / User-agent: Diffbot Allow: / User-agent: FacebookBot Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: Meta-ExternalFetcher Allow: / User-agent: ImagesiftBot Allow: / User-agent: Omgili Allow: / User-agent: YouBot Allow: / User-agent: AmazonBot Allow: / User-agent: TimpiBot Allow: / User-agent: Webzio-Extended Allow: / User-agent: AI2Bot Allow: / User-agent: Mistralai-User Allow: /