Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GH-678: Data Samplers #908

Merged
merged 6 commits into from
Jul 19, 2019
Merged

GH-678: Data Samplers #908

merged 6 commits into from
Jul 19, 2019

Conversation

alanakbik
Copy link
Collaborator

This PR adds a set of Samplers that can be used in the ModelTrainer:

The ImbalancedClassificationDatasetSampler will upsample rare classes and downsample common classes in a classification dataset. It may potentially help with imbalanced datasets.

The two chunk samplers split a dataset into chunks and shuffle them. This preserves some ordering of the original data while also randomizing the data.

@alanakbik
Copy link
Collaborator Author

@bluesheeptoken thanks! I've made the changes.

@alanakbik
Copy link
Collaborator Author

👍

1 similar comment
@yosipk
Copy link
Collaborator

yosipk commented Jul 19, 2019

👍

@yosipk yosipk merged commit 2f04c94 into master Jul 19, 2019
@alanakbik alanakbik deleted the samplers branch July 22, 2019 10:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants