Build software better, together

dbiir / UER-py

Star

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Updated May 9, 2024
Python

Tencent / TencentPretrain

Star

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Updated Aug 4, 2024
Python

nlp-uoregon / trankit

Star

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

multilingual nlp machine-learning natural-language-processing pytorch artificial-intelligence adapters deeplearning language-model universal-dependencies dependency-parsing tokenization lemmatization sentence-segmentation morphological-tagging part-of-speech-tagging xlm-roberta

Updated Apr 17, 2024
Python

explosion / curated-transformers

Star

🤖 A PyTorch library of curated Transformer models and their composable components

nlp transformers pytorch falcon transformer llama albert bert roberta camembert xlm-roberta llm llms dolly2 gptneox

Updated Apr 17, 2024
Python

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…

named-entity-recognition document-classification natural-language-inference bert sentiment-classification textual-entailment emotion-classification bangla-nlp bengali-language-processing bengali-natural-language-processing multilingual-models bengali-nlp bert-fine-tuning xlm-roberta mbert bangla-language-processing bangla-natural-language-processing banglabert

Updated Jan 24, 2023
Python

iflytek / cino

Star

CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)

nlp transformers pytorch chinese-nlp xlm-roberta xlmr cino

Updated Mar 6, 2023
Python

EveripediaNetwork / fastc

Star

Unattended Lightweight Text Classifiers with LLM Embeddings

text-classification embeddings bert e5 roberta xlm-roberta minilm deberta llm

Updated Sep 6, 2024
Python

Data-Science-kosta / Long-texts-Sentiment-Analysis-RoBERTa

Star

PyTorch implementation of Sentiment Analysis of the long texts written in Serbian language (which is underused language) using pretrained Multilingual RoBERTa based model (XLM-R) on the small dataset.

multilingual sentiment-analysis text-classification pytorch serbian lstm movie-reviews bert small-dataset serbian-language roberta pytorch-implementation multilingual-models xlm-roberta xlmroberta roberta-model long-texts

Updated Nov 20, 2022
Jupyter Notebook

Kirill-Kravtsov / drophead-pytorch

Star

An implementation of drophead regularization for pytorch transformers

transformers pytorch dropout regularization bert self-attention roberta xlm-roberta drophead

Updated Aug 24, 2021
Python

GeekDream-x / SemEval2022-Task8-TonyX

Star

Deep-learning system proposed by HFL for SemEval-2022 Task 8: Multilingual News Similarity

multilingual nlp machine-learning natural-language-processing deep-learning paper multi-lingual semantic-similarity computational-linguistics cross-lingual crosslingual xlm-roberta semeval-2022

Updated Jul 15, 2022
Python

hate-alert / Tutorial-Resources

Star

Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021

nlp natural-language-processing tutorial twitter hatespeech abuse-detection hate-speech bert-model counterspeech hate-speech-detection huggingface xlm-roberta xlmroberta huggingface-transformers icwsm2021

Updated Feb 23, 2022
Python

crux82 / AILC-lectures2021-lab

Star

This is a Pytorch (+ Huggingface transformers) implementation of a "simple" text classifier defined using BERT-based models. In this lab we will see how it is simple to use BERT for a sentence classification task, obtaining state-of-the-art results in few lines of python code.

multilingual sentiment-analysis english italian sentence-classification albert bert question-classification roberta xlm-roberta

Updated Jun 17, 2021
Jupyter Notebook

ashwanitanwar / nmt-transfer-learning-xlm-r

Star

Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning

neural-machine-translation transfer-learning language-model self-attention transformer-architecture xlm-roberta

Updated Nov 3, 2022
Python

SayamAlt / Language-Detection-using-fine-tuned-XLM-Roberta-Base-Transformer-Model

Star

Successfully developed a language detection transformer model that can accurately recognize the language in which any given text is written.

nlp text-classification feature-engineering model-evaluation fine-tuning text-preprocessing bert-fine-tuning xlm-roberta model-evaluation-metrics

Updated Dec 21, 2022
Jupyter Notebook

tensordot / syntaxdot

Star

Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.

morphology pretrained-models bert dependency-parsing lemmatization biaffine-parser part-of-speech-tagging xlm-roberta sequence-labels

Updated Oct 22, 2023
Rust

rasyosef / amharic-news-category-classification

Star

notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification dataset and the transformers library

text-classification transformers amharic bert fine-tuning huggingface xlm-roberta

Updated May 10, 2024
Jupyter Notebook

cambridgeltl / BLICEr

Star

Improving Bilingual Lexicon Induction with Cross-Encoder Reranking (Findings of EMNLP 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

information-retrieval machine-translation word-embeddings pytorch self-learning word-alignment reranking bilingual-word-embedding bilingual-lexicon-extraction fasttext-embeddings cross-lingual-embeddings xlm-roberta xlm-r low-resource-machine-translation bilingual-lexicon-induction cross-encoder cross-lingual-word-embedding word-translation cross-lingual-word-embeddings bilingual-dictionary-induction

Updated Feb 2, 2023
Python

leffff / AI-IJC

Star

1st place solution to AI IJC Customer Service task

nlp machine-learning russian-language unsupervised-data-augmentation xlm-roberta

Updated Aug 15, 2022

abhilash1910 / NLP-Workshop-ML-India

Sponsor

Star

NLP Workshop -ML India

transformer bart transfer-learning albert nlp-machine-learning cnn-keras bert statistical-models lstm-neural-networks keras-tensorflow xgboost-algorithm roberta gpt-2 huggingface distilbert xlm-roberta lightgbm-classifier

Updated Oct 13, 2020
Jupyter Notebook

codewithzichao / Multilingual-Transformers

Star

Our source code for EACL2021 workshop: Offensive Language Identification in Dravidian Languages. We ranked 4th, 4th and 3rd in Tamil, Malayalam and Kannada language of this task finally!🥳

multilingual pytorch transformer xlm-roberta eacl

Updated Jan 30, 2021
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xlm-roberta

Here are 70 public repositories matching this topic...

dbiir / UER-py

Tencent / TencentPretrain

nlp-uoregon / trankit

explosion / curated-transformers

csebuetnlp / banglabert

iflytek / cino

EveripediaNetwork / fastc

Data-Science-kosta / Long-texts-Sentiment-Analysis-RoBERTa

Kirill-Kravtsov / drophead-pytorch

GeekDream-x / SemEval2022-Task8-TonyX

hate-alert / Tutorial-Resources

crux82 / AILC-lectures2021-lab

ashwanitanwar / nmt-transfer-learning-xlm-r

SayamAlt / Language-Detection-using-fine-tuned-XLM-Roberta-Base-Transformer-Model

tensordot / syntaxdot

rasyosef / amharic-news-category-classification

cambridgeltl / BLICEr

leffff / AI-IJC

abhilash1910 / NLP-Workshop-ML-India

codewithzichao / Multilingual-Transformers

Improve this page

Add this topic to your repo