[FEATURE] A Universal ML command in PPL for train/predict/trainandpredict #849

jngz-es · 2022-09-20T17:23:58Z

Is your feature request related to a problem?
Currently for each new algorithm in ml-commons, we have to add a new command in PPL which means we have to implement an entire PPL command process including syntax parser, logical plan and physical plan. It is very inefficient for development.
From user interface perspective, it is more clean and reasonable to have one command for all algorithms than each command for each algorithm.

What solution would you like?
We want to provide only one PPL command (ml) for all algorithms in ml-commons about train/predict/trainandpredict. So for new algorithm launch, we just need to add some changes in ml-commons part, don't need to touch PPL plugin part any more.

What alternatives have you considered?
We considered add 3 commands each for train, predict and trainandpredict in terms of ml-commons APIs.

Do you have any additional context?
We plan to keep the existing algorithms PPL commands at this moment, but want to deprecate them in the future.

Example
ml action=train algo=kmeans centroids=3 iterations=2 distance_type='cosine'

dai-chen · 2022-09-20T22:14:37Z

@jngz-es I've marked this as RFC. If needed, you can put some query examples for community feedback. Thanks!

jngz-es · 2022-09-20T22:38:41Z

@jngz-es I've marked this as RFC. If needed, you can put some query examples for community feedback. Thanks!

Added an example query, thanks!

ahopp · 2022-10-26T19:21:49Z

Looks like I'm very late on this, but I think its important to provide justification on why this feature is being developed/have been developed in PPL but not in parity SQL. I assume you all chose PPL because it was easier and/or because you all needed it downstream (e.g., PPL in observability) but I think it's important to share this justification with the community given the adoption (i.e., SQL versus PPL). I realize it is used heavily in the observability plugin experience and if that's the justification, we should be clear.

dai-chen · 2022-11-07T19:40:51Z

I assume we can close this. If anything else, please open new issue labeled with future release version. Thanks!

jngz-es added enhancement New feature or request untriaged labels Sep 20, 2022

dai-chen added PPL Piped processing language RFC Request For Comments and removed untriaged labels Sep 20, 2022

jngz-es mentioned this issue Oct 26, 2022

A Generic ML Command in PPL #971

Merged

6 tasks

dai-chen added the v2.4.0 'Issues and PRs related to version v2.4.0' label Oct 31, 2022

jngz-es mentioned this issue Nov 1, 2022

Add document for ml command. #1001

Merged

6 tasks

anirudha added v2.4.0 'Issues and PRs related to version v2.4.0' and removed v2.4.0 'Issues and PRs related to version v2.4.0' labels Nov 2, 2022

dai-chen closed this as completed Nov 7, 2022

dai-chen added feature and removed enhancement New feature or request labels Nov 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] A Universal ML command in PPL for train/predict/trainandpredict #849

[FEATURE] A Universal ML command in PPL for train/predict/trainandpredict #849

jngz-es commented Sep 20, 2022 •

edited

Loading

dai-chen commented Sep 20, 2022

jngz-es commented Sep 20, 2022

ahopp commented Oct 26, 2022

dai-chen commented Nov 7, 2022

[FEATURE] A Universal ML command in PPL for train/predict/trainandpredict #849

[FEATURE] A Universal ML command in PPL for train/predict/trainandpredict #849

Comments

jngz-es commented Sep 20, 2022 • edited Loading

dai-chen commented Sep 20, 2022

jngz-es commented Sep 20, 2022

ahopp commented Oct 26, 2022

dai-chen commented Nov 7, 2022

jngz-es commented Sep 20, 2022 •

edited

Loading