Fast Augmentation library for NLP
-
Updated
Aug 19, 2024 - Rust
Fast Augmentation library for NLP
augmented reinforment learning in tensorflow with open gym and blender or unity
A PyPI package for augmenting text data using NLP techniques directly in your pandas dataframe.
[WIP] Fast text augmentation for small text corvus
AAAI Knowledge NLP Submission
solution for https://www.kaggle.com/c/google-quest-challenge
This repository contains the data and code for the paper "Self-training with Two-phase Self-augmentation for Few-shot Dialogue Generation" (EMNLP2022-Findings).
NLP based Industrial Accident Severity Assessment
This repo offers a Python script using NLPAug library & RTT to augment text datasets. It processes TXT files in "data/" folder, translating text and creating augmented versions. Augmented data enhances NLP tasks like chatbot training & text classification. Includes overview of techniques, applications & implementation.
MSc Thesis Code
Source Code, data, and results for my paper titled Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching.
Feature space Augmentation
Common approaches to text augmentation, from random text-editing perturbations, back translation, to model-based transformations.
A study that aims to unfold what emotions did Filipino students manifest during a year of Covid-19 quarantines.
Dritributed Text Augmentation Techniques (Appeared AAAI 2023)
Use online translation tool to effectively generates new datasets in other language from original datasets, especially from those popular standard baseline datasets for specific tasks.
ANSI and Unicode are encoding standards used across the world by writers and common users. ANSI is an older encoding version and is used in operating systems like Windows 95/ 98 and much older systems. Unicode is a newer version of encoding used in the current day operating systems
Bangla Text Augmentation
This library helps you to create random words i.e noise in text data. Helpful in many tasks like the generation of random authorization token generation of constant or variable length, text data augmentation, etc.
Add a description, image, and links to the text-augmentation topic page so that developers can more easily learn about it.
To associate your repository with the text-augmentation topic, visit your repo's landing page and select "manage topics."