Skip to content
This repository has been archived by the owner on Feb 1, 2022. It is now read-only.

project setup #3

Merged
merged 5 commits into from
May 6, 2019
Merged

project setup #3

merged 5 commits into from
May 6, 2019

Conversation

sperlingxx
Copy link
Member

@sperlingxx sperlingxx commented Apr 30, 2019

I'm trying to build the skeleton of the project by following works:

  • setup Gopkg.toml
  • setup code-gen tools
  • define XGBoostJob and XGBoostJobSpec
  • generate boilerplate codes from XGBoostJob and XGBoostJobSpec

This change is Reviewable

@richardsliu
Copy link

Since this is a new repo, should the API version start with v1alpha1?

@richardsliu
Copy link

Also should the repo be named xgboost-operator to be consistent with others?

@terrytangyuan
Copy link
Member

terrytangyuan commented Apr 30, 2019

+1 to rename this repo for consistency. We also mentioned this in kubeflow/community#247 (comment). Maybe @jlewi @richardsliu or someone else with access could help change it?

// active.
RunPolicy *commmonv1.RunPolicy `json:"runPolicy,omitempty"`

// XGBReplicaSpec specifies the PyTorch replicas to run.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pytorch -> typo here

RunPolicy *commmonv1.RunPolicy `json:"runPolicy,omitempty"`

// XGBReplicaSpec specifies the PyTorch replicas to run.
XGBReplicaSpec *commmonv1.ReplicaSpec `json:"xgbReplicaSpec"`
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A couple things here:

  • XGBReplicaSpec should be plural XGBReplicaSpecs
  • You might want to define the corresponding XGBReplicaType so that XGBReplicaSpecs is a map, e.g. map[XGBReplicaType]*ReplicaSpec
  • Should we consider using full name XGBoost or XGB here? I'd suggest we keep it consistent.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think since distributed XGBoost run in AllReduce way, users are not expected to define master spec and worker spec separately. So, we only need a single ReplicaSpec.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I saw master and worker replicas in the original proposal kubeflow/community#247 though.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The master is selected from the cluster like the pytorch job, thus, we can leave the replica for the master like the pytorch job here. it is possible that users want to more than one replica of master. For the worker. the replica is required for fault-tolerance .

@@ -0,0 +1,84 @@
ignored = ["github.com/kubeflow/xgboost"]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you need all these dependencies here yet? I'd suggest we add them when needed.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll remove unused dep.

// Read-only.
Status commmonv1.JobStatus `json:"status,omitempty"`

commmonv1.CleanPodPolicy
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't CleanPodPolicy also part of commmonv1.RunPolicy? Why is it separately added?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh~Its a mistake, I'll remove it.

@merlintang
Copy link
Contributor

LGTM! good start !

Copy link
Member

@terrytangyuan terrytangyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@merlintang
Copy link
Contributor

lgtm

Copy link
Member

@terrytangyuan terrytangyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/approve

@k8s-ci-robot
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: terrytangyuan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot merged commit 21a77d1 into kubeflow:master May 6, 2019
xfate123 pushed a commit to xfate123/xgboost-operator that referenced this pull request May 16, 2020
* build bolierplate

* add verify-codegen.sh

* some modifications

* fix update-codegen.sh

* add register/defaults/constants for v1alpha1
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants