Reference

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.

https://paperswithcode.com/method/bert

0

1

Updated 2021-08-12

Tags

Data Science

Related