Bert: Can anyone point me in the code for the segment and position embeddings?

Created on 22 Jan 2019 · 8Comments · Source: google-research/bert

I'm referring to the embeddings of the this picture.

Source

JoaoLages

Most helpful comment

@JoaoLages I wrote a blog post explaining the embeddings. I hope its detailed enough :)

hsm207 on 19 Feb 2019

❤3 🎉1

All 8 comments

It's at line 185 in modelling.py

hsm207 on 22 Jan 2019

👍1

Leaving this issue open as someone (or me) might have some doubts about the implementation (which is not explained in detail in the paper)

JoaoLages on 6 Feb 2019

yes me
On Feb 6, 2019 6:44 PM, João Lages notifications@github.com wrote:

Leaving this issue open as someone (or me) might have some doubts about the implementation (which is not explained in detail in the paper)

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHubhttps://github.com/google-research/bert/issues/386#issuecomment-460994435, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AWBz_C5HTZdbNwzPGRg1NqIH-eFhq0w-ks5vKsADgaJpZM4aMcK8.

IntelOSt on 6 Feb 2019

@hsm207 would you care to explain the implementation of these embeddings in detail? I.e., what are each embeddings (token, segment and position) and how they are combined? Thanks :)

JoaoLages on 18 Feb 2019

@JoaoLages I wrote a blog post explaining the embeddings. I hope its detailed enough :)

hsm207 on 19 Feb 2019

❤3 🎉1

@hsm207 Once again, thank you A LOT for the amazing blog post. I just have another question: BERT mentions BPE in their paper. But for what I see the embedding table is actually a lookup table, that does not deal with OOV problems.

To my understanding, BPE is used to train the word piece embeddings. Idk if BERT retrained those embeddings previously or just used the pretrained ones in the lookup table for the token embeddings.

JoaoLages on 25 Mar 2019

@JoaoLages I'm glad it helped.

Where in the paper was BPE mentioned?

hsm207 on 25 Mar 2019

Oh sorry, I was confused with the openAI GPT-2 model. All good :)

JoaoLages on 25 Mar 2019

👍1

Was this page helpful?

0 / 5 - 0 ratings