Questions tagged [google-kubernetes-engine]

Google Kubernetes Engine (formerly known as Container Engine) takes care of provisioning and maintaining the underlying virtual machine cluster, scaling your application, and operational logistics like logging, monitoring, and health management.

Google Kubernetes Engine takes care of provisioning and maintaining the underlying virtual machine cluster, scaling your application, and operational logistics like logging, monitoring, and health management.

6757 questions
2
votes
1 answer

GKE autopilot not scaling up at all

I can use GKE Autopilot to run arbitrary workloads on a sandbox project (with default networks, default service account, default firewall rules) just fine. But I need to create a GKE Autopilot cluster in an existing project which isn't using the…
Boris
  • 601
  • 3
  • 13
2
votes
1 answer

Should I disable Cloud Build and GCR after building an app on Kubernetes Engine?

I am a first time cloud user: I have created an application using Docker Image, then stored it in Google Container Registry (GCR) and then deployed it in Google Kubernetes Engine (GKE). Now the application is working fast and properly. I was…
Sticky
  • 151
  • 1
  • 10
2
votes
2 answers

Unschedulable GPU workload on GKE from node pool

I am running a GPU intensive workload on demand on GKE Standard, where I have created the appropriate node pool with minimum 0 and maximum 5 nodes. However, when a Job is scheduled on the node pool, GKE presents the following error: Events: Type …
2
votes
1 answer

getting error while implementing metric server inside the Kubernetes

"Failed to scrape node" ` err="GET \"https://10.128.0.17:10250/stats/summary? only_cpu_and_memory=true\": bad status code \"403 Forbidden\"" node="gke-zipydev-cluster-zipy-pool-b4bfa53a-t575" I1215 10:33:03.405180 1 server.go:188] "Failed…
Nikhil Verma
  • 21
  • 1
  • 2
2
votes
1 answer

How to get IAM/service account used by juicefs to access GCS in GKE?

I'm using a juicefs-csi in GKE. I use postgre as meta-store and GCS as storage. The corresponding setting is as follow: node: # ... storageClasses: - name: juicefs-sc enabled: true reclaimPolicy: Retain backend: …
goFrendiAsgard
  • 4,016
  • 8
  • 38
  • 64
2
votes
1 answer

Failed to export the service using GKE Multi-Cluster Service

I'm configuring the GKE Multi-Cluster Service according to document1 and document2 and inspired by the multi-cluster-serice-communication-in-gke tutorial. Somehow I'm failing on the "Registering a Service for export" on second cluster. I'm using the…
2
votes
1 answer

GKE Pod download speed of BigQuery API is very slow

I have an image that performs a query on BigQuery and then downloads the result (about 8GB) with the following code: query_job = client.query(my_sql_query) query_result = query_job.result() data_table =…
2
votes
1 answer

How to allow GKE internal Pod to communicate through VPN to an internal IP in another VPC?

I have a GKE cluster (private one) with a NAT that I need to put in networking with a legacy VPC (in another GCP project). I built a classic VPN between Project B (new) and Project A (old): all VM can talk to each other (nc -vz is my friend). The…
2
votes
0 answers

PubSub GKE Connectivity Issue - Failed to create producer binding; retrying in 30 seconds

I am using Spring Cloud Stream and trying to deploy it onto GKE and connect it to PubSub to send messages. Application works perfectly fine locally, sending messages to a specific topic on GCP but once deployed on GKE, I get following error: ERROR 1…
2
votes
1 answer

Can you redirect HTTP to HTTPS with a k8s Ingress?

I've tried everything on stack overflow and beyond and can't find a solution that works to redirect http to https. My current config is below. My ingress is: apiVersion: networking.k8s.io/v1 kind: Ingress metadata: name: ingress annotations: …
nickponline
  • 25,354
  • 32
  • 99
  • 167
2
votes
1 answer

How to reserve a node in GKE Autopilot

Is it possible to keep always one additional node in GKE autopilot reserved besides of the currently used nodes to help reduce scaling time? For example if we currently have 5 nodes and there is a spike in the application traffic, it often happens…
2
votes
2 answers

Error 403: Storage objects forbidden in GCP

I’m trying to create all new sandbox project in GCP for easy deployment and upgrade project. Using Terraform I am creating a GKE cluster. Issue is, the terraform scripts are written for the service accounts of a project named let’s say NP-H. Now, I…
2
votes
0 answers

Ambassador Host {hostname}: invalid TLS secret ssl-cert, marking inactive

I am using the Ambassador in my GKE cluster. I am getting below issue but I am having valid ssl certificate as secret in my configuration. Secret myapp-ssl-cert.ambassadorpoc unknown Host myapp: invalid TLS secret myapp-ssl-cert, marking…
2
votes
1 answer

OSError: Timed out trying to connect to Dask Scheduler from GCE to GKE - externalIP

I'm trying connect my dask Client to the dask scheduler service deployed on my GKE. The deployment has been done with Helm and the dask-scheduler service has LoadBalancer as type, so it has an externalIP. When trying to connect from inside the…
2
votes
1 answer

Is it possible to get GCP's ANY distribution for Kubernetes GKE node pool?

I have a GKE Kubernetes cluster running on GCP. This cluster has multiple node pools set with autoscale ON and placed at us-central1-f. Today we started getting a lot of errors on these Node pools' Managed Instance Groups saying that us-central1-f…