There is only example on how to run for WikiText-103, but not Billion Word, for the adaptive input paper by Baevski and Auli, 2018.
There is a difference between the two, as Billion Word is of short sentences. Can anyone help out?
@alexeib
Would be interested in this too!
Also interested in the correct setting to reproduce the results.
Most helpful comment
Would be interested in this too!