Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update kubernetes_deployment.md #398

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

Rajakavitha1
Copy link
Contributor

@Rajakavitha1 Rajakavitha1 commented Sep 23, 2024

fixes: #383

"Fixes #383 " or "Add description of the problem this PR solves"

Proposed Changes

fixes: kserve#383

Signed-off-by: Rajakavitha Kodhandapani <krajakavitha@gmail.com>
Copy link

netlify bot commented Sep 23, 2024

Deploy Preview for elastic-nobel-0aef7a ready!

Name Link
🔨 Latest commit bf1d4f9
🔍 Latest deploy log https://app.netlify.com/sites/elastic-nobel-0aef7a/deploys/66f102b1a4da0c0008e79c6d
😎 Deploy Preview https://deploy-preview-398--elastic-nobel-0aef7a.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

fixed formatting

Signed-off-by: Rajakavitha Kodhandapani <krajakavitha@gmail.com>
format

Signed-off-by: Rajakavitha Kodhandapani <krajakavitha@gmail.com>
Copy link
Member

@terrytangyuan terrytangyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your contribution!

"disableIngressCreation": true,
"ingressDomain": "svc.cluster.local",
"ingressGateway" : "disabled",
"ingressService" : "disabled"
Copy link

@spolti spolti Sep 23, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

here we also need to set the default ingressClass otherwise you will need to do everything manually later.
If the reason is to decouple from Istio, most of k8s cluster have a default one, we need to give this info to the user e.g. minikube uses nginx.
Instead disabling the ingress would be better to explain both options.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please suggest what change needs to be made to the yaml file.

@@ -1,52 +1,55 @@
# Kubernetes Deployment Installation Guide
KServe supports `RawDeployment` mode to enable `InferenceService` deployment with Kubernetes resources [`Deployment`](https://kubernetes.io/docs/concepts/workloads/controllers/deployment), [`Service`](https://kubernetes.io/docs/concepts/services-networking/service), [`Ingress`](https://kubernetes.io/docs/concepts/services-networking/ingress) and [`Horizontal Pod Autoscaler`](https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale). Comparing to serverless deployment it unlocks Knative limitations such as mounting multiple volumes, on the other hand `Scale down and from Zero` is not supported in `RawDeployment` mode.

Kubernetes 1.22 is the minimally required version and please check the following recommended Istio versions for the corresponding
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure if we need to remove it.

Copy link
Contributor Author

@Rajakavitha1 Rajakavitha1 Sep 24, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My understanding was that Istio was not required The current installation guide for raw deployment mode is not accurate. For example, Istio is not needed if disableIngressCreation is enabled. See https://github.com/kserve/kserve/pull/3436

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Improve raw deployment documentation
3 participants