Thanks to pytorch-pretrained-BERT, we can add a new embeddings class for Transformer-XL embeddings (idea mainly taken from #68).
Awesome!
I'm not sure if the PR should land into the upcoming 0.4.1 release 馃
Training is working :) Previous experiments I made resulted in a F-Score of 90 - 91 on CoNLL 2003 (English). I'm currently running a new experiment + I will share the results here soon!
Hey wow that was fast :)
Training is also working on my end - I'm doing some example training runs. I'll merge into 0.4.1 branch in a bit once first numbers come in!
Transformer-XL embeddings can be used in 0.4.1 now, so I'm closing here :)
Most helpful comment
Training is also working on my end - I'm doing some example training runs. I'll merge into 0.4.1 branch in a bit once first numbers come in!