When opening a gatsby site in google translate, it shows a flash of translated content and then shows a 404 page.
E.g.
https://translate.google.com/translate?sl=auto&tl=fr&js=y&hl=en&u=https%3A%2F%2Freactjs.org
or
Open any gatsby page in google translate.
To show the page with the translated text.
A 404 page is shown.
Production
Huh. Google translate re-hosts the site (with translated content) in an iframe with a URL like https://translate.google.com/translate?hl=en&sl=en&tl=fr&u=about.sourcegraph.com
.
Gatsby doesn't know it's being hosted on a different domain and tries to load the content for the /translate
page, which doesn't exist, so it then shows the 404 page instead.
I've marked this as a bug but I'm not sure what the fix would be. Maybe Gatsby shouldn't load the 404 page if the initial SSR render _isn't_ the 404 page?
That sounds like it could be a reasonable fix. Check if a SSRed page is loaded and trust it over the client URL.
This also seems to be an issue with google's webcache, as the url ends up looking like: http://webcache.googleusercontent.com/search?q=cache:[GOOGLE_CACHE_KEY]:[YOUR_CONTENTS_ENDPOINT][GOOGLE_APPENDED_QUERY_VARIABLES]
Which then causes it to redirect to a 404 page and output to the console that A page wasn't found for "/search"
.
I would imagine this is going to happen for any service that acts as a sort of proxy to the content created through Gatsby.
Currently using Gatsby v1.9.273.
@m-allanson
Have you tried accessing your endpoint on google webcache? I tried it a few times with the example you posted (about.sourcegraph.com) and it seems to be working, but trying to access the same endpoint via google translate redirects to the 404 page. Whereas in my case, it redirects to 404 consistently on both.
EDIT: Now that I think about it, what I was seeing was probably the /search
endpoint for that site. So please disregard the comments above and assume the behavior is consistently broken for sites without those endpoints.
Dan Abramov came up with a work around for this — https://github.com/reactjs/reactjs.org/pull/1148
To be clear my workaround is for the crash when using the Translate extension. I haven’t looked into the URL issue but we’d need to solve it too. Ideas?
@gaearon oh hmm yeah — so fixing that would mean Gatsby needs to support alt URL patterns where some other software has taken control of the URL. Seems doable.
Hi All, from Dan Twitter. https://twitter.com/dan_abramov/status/1035575858843578369
Oh, Page Not Found 😧
@KyleAMathews @gaearon
Console say A page wasn't found for "/translate_c"
Perhaps Google Translate attempt to try access /translate_c
at inner JavaScript.
https://www.dropbox.com/s/bfy2t4kc31smc7d/google-react-gatsby.mp4?dl=0
findPage()
appears to be the cause.
How to handle it😰
const page = findPage(path)
if (!page) {
handleResourceLoadError(path, `A page wasn't found for "${path}"`)
@ryota-murakami thanks for looking into this! The logic for finding pages is in https://github.com/gatsbyjs/gatsby/blob/master/packages/gatsby/cache-dir/find-page.js
@KyleAMathews I'm afraid super slow response🙇♂️
I tried fix the Issue, however currently 404 page doesn't show(whiteout browser screen instead) because following change in https://github.com/johncmunson/gatsby/commit/224f6a883e62c134a600e8f707a507bfa166be14.
https://github.com/gatsbyjs/gatsby/blob/master/packages/gatsby/cache-dir/loader.js#L315-L317
As far as I reed commit message that change has been have a different purpose(preload 404) but that is affecting this Issue.
I suppose that fix approach might be better if build by gatsby website loaded from
What do you think @KyleAMathews ?
Still I don't understand gatsby entire mechanism, each package role though 🤔
Old issues will be closed after 30 days of inactivity. This issue has been quiet for 20 days and is being marked as stale. Reply here or add the label "not stale" to keep this issue open!
Hey again!
It’s been 30 days since anything happened on this issue, so our friendly neighborhood robot (that’s me!) is going to close it.
Please keep in mind that I’m only a robot, so if I’ve closed this issue in error, I’m HUMAN_EMOTION_SORRY
. Please feel free to reopen this issue or create a new one if you need anything else.
Thanks again for being part of the Gatsby community!
This isn't fixed, so I'm reopening it.
I'm hitting this issue with gatsby v1 - would it be okay for me to offer a $50 USD bounty for someone to solve it?
Just came across this issue. At first I thought it was a CORS issue, which I resolved by setting the following settings for gatsby-plugin-netlify:
{
resolve: `gatsby-plugin-netlify`,
options: {
headers: {
'/*': [
'Access-Control-Allow-Origin: https://translate.googleusercontent.com',
'Access-Control-Allow-Credentials: true',
'Content-Security-Policy: frame-ancestors https://translate.google.com',
'X-Frame-Options: ALLOW-FROM https://translate.google.com',
],
},
mergeSecurityHeaders: false,
},
},
Now I'm seeing a 404 (in the console, not on the front end).
GET https://southsoundymca.jayray.com/page-data/translate_c/page-data.json 404
This seems to reflect the issue above. Digging into it further it looks like all Gatsby sites are affected and as such, are not able to be translated by google translate. Anyone know if this is actively being worked on? 🤔
We have yet to attempt an install but definitely hope there is a path forward -- Google Translate is a pretty amazing option for clients who don't have the resources do full translations in multiple languages, and it seems perfectly in line with the philosophy behind Gatsby and the JAMstack (so much so that it didn't even occur to me that it would not work, and sadly, I'm finding this out after it is already in the scope-of-work for a current Gatsby project :)
Our team is fairly new to Gatsby and React so I don't understand the internals well enough to jump right in, but if anybody IS working on this and has anything to share or that we can contribute to please let us know, and we will do our best to help as we can.
@broeker, I haven't gotten any answer on this. Let's team up and figure it out? It sounds like we're in a similar situation.
@florinme FYI: Adding your recommend config on my site fixes the white page issue but I don't get a 404, the content flashes with the translation before reverting back to the original language. Maybe I'm on a newer version of Gatsby (I'm at the latest)?
https://5d2ab3e25c12e1d91c1707db--moonmeister-personal.netlify.com/
UPDATE: NVM, I'm seeing it in the console same as you.
@moonmeister @broeker had any luck with a workaround? I'm totally in a jam and don't know what to do...
Okay, so I've done some work on this and understand what's happening better. There are two issues causing problems for sites. The first is unrelated to Gatsby and is a security issue, the second is Gatsby re-hydrating our static pages.
UPDATE: The original issue reported was actually an errant 404 page and path issue(A page wasn't found for "/translate_c"
). I have not seen this on my site, gatsbyjs.org, or reactjs.org. Not sure if Google changed something or if this got fixed in Gatsby somewhere, but it doesn't seem to be part of the larger issue any longer.
Translating a site like reactjs.org (yes, they use gatsby) results in the error:
Load denied by X-Frame-Options: https://reactjs.org/?depth=1&hl=en&rurl=translate.google.com…259,15700262,15700265&usg=ALkJrhgfSw3S-344F8stcoY_Wcxu2-llPA does not permit framing.
This has nothing to do with Gatsby and everything to do with the security headers set on your web server for CORS and CSP. @florinme has correctly identified the correct settings above. This is probably a more common issue with Gatsby sites; because, newer hosting providers like Netlify, which are commonly used with static asset sites, wisely use CSP to block sites from being embedded in iframes.
I'll give a quick explanation of the 4 settings indicated above:
CORS blocks examplea.com from loading content from exampleb.org. To allow cross domain content loading, we must set Response Headers on the web server. So, for Google Translate to load our resources in an iframe we need our hosting provider to set these two headers:
Access-Control-Allow-Origin: https://translate.googleusercontent.com
Access-Control-Allow-Credentials: true
The CSP Response Header tells the browser what can be done with the page being loaded. Loading a site into an iframe is a potential security risk and thus some hosting providers block this by default. Modifying this header allows only Google Translate to load the site:
Content-Security-Policy: frame-ancestors https://translate.google.com
Before CSP was a Standard HTTP Response Header we had a convention to block loading content in iframes, it was X-Frame-Options. THis is ignored if a browser supports CSP, but is useful to ensure backwards compatibility of older browsers(mostly IE).
X-Frame-Options: ALLOW-FROM https://translate.google.com
Part of Gatsby's mystic is how we ship static content that gets re-hydrated into a fully client-side react app. Cool right? well it's causing an issue. Google Translate is loading a pages static html and translating that, about the time it's finished React swoops in and turns your translated content into a react app and overwrites it all with the content(in the original language) from the page's page-data.json
. Because Google Translate thinks it's job is done it moves on and doesn't re-translate the page.
Fixing problem 1 fixes Google Translate showing a blank white screen. Problem 2 causes the flash of translated content reverting back to the original language.
Unfortunately I have no idea how to fix this. If we somehow disabled the Gatsby's re-hydration of the react app we'd be be disabling site functionality (though pausing JS execution via the debugger does show this would work for translation). What really needs to happen is we need to let Google Translate know the page isn't actually finished loading, or that it has been "reloaded"
I don't know if this means dispatching the correct event or if it's not even possible.
This can be solved by disabling the client side router.
You can do that by using this plugin: https://www.gatsbyjs.org/packages/@wardpeet/gatsby-plugin-static-site/ (currently broken).
There is a fork that works with the current latest version of Gatsby that you can found here: https://www.npmjs.com/package/@xavivars/gatsby-plugin-static-site
I uploaded a working example just to show up the solution, the original page is in spanish, and this link displays it translated to english using Google Translate.
I think that this issue could lead to some frustration so I don't know if it is a good idea to have this documented somewhere @marcysutton, I'm thinking on something like a guide on how to disable the client side router and in which cases you could do that 🤔
Aside from this issue, in #4337 there is a discussion with other potential use cases to disable the router.
@urielhdz Thanks for the Info, yes this will fix the issue I identified. It'd be nice if there was a way to maintain CSR and make this work.
Thanks to both @moonmeister and @urielhdz for the work on this! What are the implications of removing the router, if we disable it for translating, navigation will stop, correct? Can we disable routing temporarily while the user is viewing a translated page?
I can confirm that navigation still works,
In regards of the downsides, I think that you lose the speed and performance benefits of using a client side router, so for example, every page transition requires a roundtrip to the server.
Good question @florinme I was asking myself the same.
In regards of the downsides, I think that you lose the speed and performance benefits of using a client side router, so for example, every page transition requires a roundtrip to the server.
@urielhdz Thanks for clarifying.
I would set a separate instance of the website with this setting in a separate domain, something like translate.domain.com.
gatsbyjs.org is still not translatable :(
https://translate.google.de/translate?sl=auto&tl=de&u=gatsbyjs.org%2Fdocs
For what it's worth, Chrome's integrated translator works great.
I had some free time this morning, and I looked into it. The issue is from production-app.js
and the router.
navigate
in production-app.js
When the page is opened on the translation service, production-app.js
made a redirection to the actual webpage. https://github.com/gatsbyjs/gatsby/blob/dced9f1a02eaa266e355599816f8ee14f64614f6/packages/gatsby/cache-dir/production-app.js#L111-L124
The translation service fetches the webpage from their host so that they can manipulate the content on other pages without the CORS problem. That means the URL structure is changed from our routing rules. navigate
is called from here. This is the reason for blinking.
It redirects to /translate_c
or something similar URL. The URL is not valid on our website, so a 404 error happens as a result.
The other problem happens when the components hydrate it. If you add the translation service into headers, it will hydrate the data again; however, it loads wrong data based on the wrong URL because of the reason above.
If the website doesn't have CORS headers, it will fail to hydrate the page anyway because the loader cannot load page-data.json
, actually any resources from the website.
If the website has CORS headers for the translation service, it will hydrate again. However, the router using window.location
and... you know, it will fail to load the proper page as we see above.
This is my solution for this but it is not clean and not ideal. I don't want to specify some translation services in my code so I just added some logic before hitting the navigate
in production-app.js
.
// gatsby-node.js
const fs = require("fs")
const path = require("path")
exports.onPreBootstrap = ({ store }) => {
const { program } = store.getState()
const filePath = path.join(program.directory, ".cache", "production-app.js")
const code = fs.readFileSync(filePath, {
encoding: `utf-8`,
})
const newCode = code.replace(
`const { pagePath, location: browserLoc } = window`,
`const { pagePath } = window
let { location: browserLoc } = window
if (window.parent.location !== browserLoc) {
browserLoc = {
pathname: pagePath
}
}
`
)
fs.writeFileSync(filePath, newCode, `utf-8`)
}
If there is a parent frame, add some stub into browserLoc
and avoid navigate
calling. If you don't have CORS headers for these services, it doesn't have a problem with hydration too because the code cannot fetch page-data.json
from your website.
Obviously, there are downsides to this approach because of missing hydration. Also, it will be a problem if you are using an iframe with the website.
The demo page is here:
https://translate.google.com/translate?sl=ko&tl=en&u=https%3A%2F%2Fxenodochial-swartz-f568e8.netlify.app%2F
I'd love to see some nice solution for this issue.
Most helpful comment
I had some free time this morning, and I looked into it. The issue is from
production-app.js
and the router.Problems
navigate
inproduction-app.js
When the page is opened on the translation service,
production-app.js
made a redirection to the actual webpage. https://github.com/gatsbyjs/gatsby/blob/dced9f1a02eaa266e355599816f8ee14f64614f6/packages/gatsby/cache-dir/production-app.js#L111-L124The translation service fetches the webpage from their host so that they can manipulate the content on other pages without the CORS problem. That means the URL structure is changed from our routing rules.
navigate
is called from here. This is the reason for blinking.It redirects to
/translate_c
or something similar URL. The URL is not valid on our website, so a 404 error happens as a result.Hydration
The other problem happens when the components hydrate it. If you add the translation service into headers, it will hydrate the data again; however, it loads wrong data based on the wrong URL because of the reason above.
If the website doesn't have CORS headers, it will fail to hydrate the page anyway because the loader cannot load
page-data.json
, actually any resources from the website.If the website has CORS headers for the translation service, it will hydrate again. However, the router using
window.location
and... you know, it will fail to load the proper page as we see above.Solution (but dirty)
This is my solution for this but it is not clean and not ideal. I don't want to specify some translation services in my code so I just added some logic before hitting the
navigate
inproduction-app.js
.If there is a parent frame, add some stub into
browserLoc
and avoidnavigate
calling. If you don't have CORS headers for these services, it doesn't have a problem with hydration too because the code cannot fetchpage-data.json
from your website.Obviously, there are downsides to this approach because of missing hydration. Also, it will be a problem if you are using an iframe with the website.
The demo page is here:
https://translate.google.com/translate?sl=ko&tl=en&u=https%3A%2F%2Fxenodochial-swartz-f568e8.netlify.app%2F
I'd love to see some nice solution for this issue.