DeepPavlov/rubert-base-cased-conversational

1年前发布 4 00

rubert-base-cased-conversat...

收录时间：

2025-06-02

打开网站手机查看

DeepPavlov/rubert-base-cased-conversational

打开网站

rubert-base-cased-conversational

Conversational RuBERT (Russian, cased, 12‑layer, 768‑hidden, 12‑heads, 180M parameters) was trained on OpenSubtitles[1], Dirty, Pikabu, and a Social Media segment of Taiga corpus[2]. We assembled a new vocabulary for Conversational RuBERT model on this data and initialized the model with RuBERT.
08.11.2021: upload model with MLM and NSP heads
[1]: P. Lison and J. Tiedemann, 2016, OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
[2]: Shavrina T., Shapovalova O. (2017) TO THE METHODOLOGY OF CORPUS CONSTRUCTION FOR MACHINE LEARNING: «TAIGA» SYNTAX TREE CORPUS AND PARSER. in proc. of “CORPORA2017”, international conference , Saint-Petersbourg, 2017.

数据统计

暂无评论

您必须登录才能参与评论！

立即登录

暂无评论...

DeepPavlov/rubert-base-cased-conversational

rubert-base-cased-conversational

数据统计

相关导航

暂无评论

网址

青苹果影院

老弟影视

YY直播

GI加速器

樱花动漫

大西瓜影视

热门推荐