Augmenting Your Text Data

Data augmentation is important step for Machine learning.

When it comes to image processing rotating image around and cropping pieces really increases capabilities of your model.

But you can not cut random peieces of text and expect it still be the same meaning.

So augmenting text based data is bit more problematic.

In this repo we will look at a dataset extracted from medical radiology reports. Our input will be expressions of findings in an x-ray scanning and our output will be the diagnosis from given findings.

This dataset is preapraed for a seq2seq model training similiar to text summarization.

You can find how to prepare similiar dataset from here -> https://github.com/onuralpArsln/dataPrepWithPandas

If you have a dataset which is already well structured you can augment it directly and start training after.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
augready.csv		augready.csv
dataAugmentipynb.ipynb		dataAugmentipynb.ipynb
dataset.csv		dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Augmenting Your Text Data

About

Releases

Packages

Languages

onuralpArsln/AugmentTextData

Folders and files

Latest commit

History

Repository files navigation

Augmenting Your Text Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages