Bert: What is BERT?

Created on 9 Apr 2019 · 2Comments · Source: google-research/bert

Hello,

I am hearing a lot that I should use BERT for applications instead of word embeddings, but I am not understanding what BERT is? Maybe becuase I do not know transformer.

Could anyone explain, what is BERT? How does it differ from word2vec, Glove and more importantly elmo.

How does bert differ from openai-gpt?

How can I adapt bert to a question-answering model or any classification task?

Any help is highly appreciated.

Thank you.

Source

ghost

Most helpful comment

Guess you want this awesome post: illustrated-bert

wqw547243068 on 9 Apr 2019

😄2 👍1

All 2 comments

Most of your questions are explained in the original paper:

https://arxiv.org/pdf/1810.04805.pdf

alexpnt on 9 Apr 2019

Guess you want this awesome post: illustrated-bert

wqw547243068 on 9 Apr 2019

😄2 👍1

Was this page helpful?

0 / 5 - 0 ratings

Related issues

It took a several days to create pretraining data and hasn't finished.

yyht · 3Comments

What are the requirements of the language in order to included in the BERT?

sharavsambuu · 3Comments

Are linear decay, L2 normalization and learned positional embs essential to the performance?

LorrinWWW · 3Comments

run run_classifier.py on chinese data, Failed to find any matching files for /path/chinese_L-12_H-768_A-12/bert_model.ckpt

qiugen · 4Comments

what is the max length of the context?

hmxv2 · 4Comments