Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Describe two datasets with same fields list (train/test) in one report #173

Closed
lukyanenkomax opened this issue Jun 17, 2019 · 7 comments
Closed
Labels
feature request 💬 Requests for new features

Comments

@lukyanenkomax
Copy link

ML tasks often require comparison of train and test datasets. It It would be nice to see description of two datasets in one report

@lukyanenkomax lukyanenkomax added the feature request 💬 Requests for new features label Jun 17, 2019
@sbrugman
Copy link
Collaborator

Cool request, anyone who wants to implement this is welcome to contribute a pull request!

@Trollgeir
Copy link

I was just about to make a feature request of the exact same thing. In addition to checking train/test, it could also be insightful to check the state of new data (like compare the 2017 data vs 2018 data).

@sbrugman
Copy link
Collaborator

I am interested in how you see the presentation of this feature in the report. Any ideas?

@lukyanenkomax
Copy link
Author

It could be two columns with reports. Report template should be narrow (less spaces, charts below text)

@Trollgeir
Copy link

I also think there should be some summary in regards to distribution differences.
User story: "I want to easily see/understand which variables vary most between dataset A and B"

@javiergodoy
Copy link

javiergodoy commented Aug 7, 2019

What about a plus-minus chart ?
plus-minus

@github-actions
Copy link

Stale issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request 💬 Requests for new features
Projects
None yet
Development

No branches or pull requests

4 participants