Avoid to index in search engines all the Phabricator's Transactions or other shit (deletions, changes, etc.)
Open, NormalPublic1 Points


I noticed that Phabricator is exposing lot of stuff to search engines like this:

I think that every single changed bit is not something we want to track. Also because often we strip-out email addresses or phone numbers and we do not want to be like Wikipedia and keep track of every single damn change. We are just here to do some co-working and we should not feed search engines with shit.

To do this I've edited the Apache virtualhost to expose a robots.txt:

User-agent: *
Disallow: /transactions/
Alias /robots.txt /home/www-data/

Related to:

Event Timeline

valerio.bozzolan closed this task as Resolved.Mon, Jan 4, 01:40
valerio.bozzolan triaged this task as Normal priority.
valerio.bozzolan created this task.
Restricted Application added a project: User-valerio.bozzolan. · View Herald TranscriptMon, Jan 4, 01:40
valerio.bozzolan reopened this task as Open.Mon, Jan 4, 02:01

Reopened because Phabricator already has a robots.txt so the Alias does not work:

User-Agent: *
Disallow: /diffusion/
Disallow: /source/
Crawl-delay: 1