Mastodon: want to get "Opt-out of search engine"

Created on 25 Jan 2018  Â·  6Comments  Â·  Source: tootsuite/mastodon

I would like information of "Opt-out of search engine indexing" to be included in API account entity and JSON of user information.
Even if I want to create an external search engine, I need to obtain from the user page whether it is opted out.
It is easy to use, if I can get it with toots or user information.


  • [x] I searched or browsed the repo’s other issues to ensure this is not a duplicate.
  • [ ] This bug happens on a tagged release and not on master (If you're a user, don't worry about this).
api suggestion

Most helpful comment

See also #7164 — potentially there needs to be a way for a API client to specify it is a bot, without relying on user agent allow, perhaps: X-Requested-With: Bot or X-Requested-With: Crawler and the API would return different or no content based on that?

All 6 comments

well, such a feature actually exists

image

i actually don't know what you mean. if the feature is enabled a meta tag called <meta content='noindex' name='robots'> will be placed into the profile pages and status html code. there's no need for an extra api.

It is wasteful communication to acquire the user page again after getting toots.
If the account entity contained in toots contains information, it can be reflected immediately.

Perhaps it'd be an idea to use the X-Robots-Tag header? https://developers.google.com/search/reference/robots_meta_tag

Currently, only HTML has "no-robot" indication.
The account data obtained from the API has no "no-robot" attribute.

I may make application that uses toots and accounts, I can not judge whether or not to show that user to the search engine crawler from the acquired JSON.
After acquiring with JSON, accessing the HTML again only to acquire the "no-robot" attribute also loads the instance.

Right, the X-Robots-Tag header on API responses would include either noindex or another robots option.

See also #7164 — potentially there needs to be a way for a API client to specify it is a bot, without relying on user agent allow, perhaps: X-Requested-With: Bot or X-Requested-With: Crawler and the API would return different or no content based on that?

Was this page helpful?
0 / 5 - 0 ratings

Related issues

flukejones picture flukejones  Â·  3Comments

ccoenen picture ccoenen  Â·  3Comments

cwebber picture cwebber  Â·  3Comments

sorin-davidoi picture sorin-davidoi  Â·  3Comments

almafeta picture almafeta  Â·  3Comments