news 2026/6/15 16:56:43

k8s部署metrics-server

作者头像

张小明

前端开发工程师

1.2k 24
文章封面图
k8s部署metrics-server

k8s部署metrics-server是 Kubernetes 实现资源监控(如kubectl top、HPA 自动扩缩容)的核心组件,在部署过程中遇到过以下问题

  • 镜像拉取失败(k8s.gcr.io镜像国内无法访问);
  • 证书验证问题(需跳过 TLS 验证或配置正确证书);
  • API Server 连接问题(需指定kubelet-insecure-tls)。

部署步骤如下

1.步骤 1:下载官方部署文件(并修改)

# 下载官方 yaml(也可手动创建) wget https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml -O metrics-server.yaml

2.步骤 2:修改metrics-server.yaml关键配置

打开metrics-server.yaml,做以下 3 处核心修改:

# 原镜像(国内无法访问) # image: k8s.gcr.io/metrics-server/metrics-server:v0.7.0 # 替换为阿里云镜像(适配 v0.7.0 版本) image: registry.cn-hangzhou.aliyuncs.com/google_containers/metrics-server:v0.7.0

3. 添加启动参数(解决证书 / 连接问题)

Deploymentargs部分,新增以下参数(关键!):

spec: template: spec: containers: - name: metrics-server args: - --cert-dir=/tmp - --secure-port=4443 # 新增以下 3 个参数 - --kubelet-insecure-tls # 跳过 kubelet TLS 验证(测试环境推荐,生产建议配置证书) - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname # 指定 kubelet 地址类型 - --metric-resolution=15s # 监控数据采集间隔

4.可选:调整资源限制(根据集群规模)

resources: requests: cpu: 100m memory: 100Mi limits: cpu: 500m memory: 512Mi

5.部署metrics-server

kubectl apply -f metrics-server.yaml

6.验证部署

kubectl get pods -n kube-system -l k8s-app=metrics-server # 正常输出(STATUS 为 Running): # NAME READY STATUS RESTARTS AGE # metrics-server-7f987d68c4-9x8zl 1/1 Running 0 5m
检查 Pod 日志(排查启动失败)
kubectl logs -n kube-system $(kubectl get pods -n kube-system -l k8s-app=metrics-server -o name) # 常见日志错误及解决: # - "x509: certificate signed by unknown authority" → 确认已加 --kubelet-insecure-tls # - "unable to reach kubelet" → 检查 --kubelet-preferred-address-types 参数 # - "image pull failed" → 确认镜像地址正确
验证 API 可用性(核心!)

metrics-server会注册metrics.k8s.ioAPI,检查是否正常:

# 查看节点资源使用 kubectl top nodes # 输出示例: # NAME CPU(cores) CPU% MEMORY(bytes) MEMORY% # k8s-master 123m 6% 1200Mi 30% # k8s-node1 89m 4% 980Mi 25% # 查看 Pod 资源使用 kubectl top pods -n kube-system # 输出包含 metrics-server 自身的资源占用

二.本次部署环境使用修改后的yaml文件如下,可直接使用

apiVersion: v1 kind: ServiceAccount metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: labels: k8s-app: metrics-server rbac.authorization.k8s.io/aggregate-to-admin: "true" rbac.authorization.k8s.io/aggregate-to-edit: "true" rbac.authorization.k8s.io/aggregate-to-view: "true" name: system:aggregated-metrics-reader rules: - apiGroups: - metrics.k8s.io resources: - pods - nodes verbs: - get - list - watch --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRole metadata: labels: k8s-app: metrics-server name: system:metrics-server rules: - apiGroups: - "" resources: - pods - nodes - nodes/stats - namespaces - configmaps verbs: - get - list - watch --- apiVersion: rbac.authorization.k8s.io/v1 kind: RoleBinding metadata: labels: k8s-app: metrics-server name: metrics-server-auth-reader namespace: kube-system roleRef: apiGroup: rbac.authorization.k8s.io kind: Role name: extension-apiserver-authentication-reader subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: labels: k8s-app: metrics-server name: metrics-server:system:auth-delegator roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:auth-delegator subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: rbac.authorization.k8s.io/v1 kind: ClusterRoleBinding metadata: labels: k8s-app: metrics-server name: system:metrics-server roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:metrics-server subjects: - kind: ServiceAccount name: metrics-server namespace: kube-system --- apiVersion: v1 kind: Service metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system spec: ports: - name: https port: 443 protocol: TCP targetPort: 8443 selector: k8s-app: metrics-server --- apiVersion: apps/v1 kind: Deployment metadata: labels: k8s-app: metrics-server name: metrics-server namespace: kube-system spec: selector: matchLabels: k8s-app: metrics-server strategy: rollingUpdate: maxUnavailable: 0 template: metadata: labels: k8s-app: metrics-server spec: containers: - args: - --cert-dir=/tmp - --secure-port=8443 - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostname - --kubelet-use-node-status-port - --metric-resolution=15s - --kubelet-insecure-tls - --authorization-always-allow-paths=/livez,/readyz image: swr.cn-east-2.myhuaweicloud.com/kuboard-dependency/metrics-server:v0.5.0 imagePullPolicy: IfNotPresent livenessProbe: failureThreshold: 3 httpGet: path: /livez port: https scheme: HTTPS periodSeconds: 10 name: metrics-server ports: - containerPort: 8443 name: https protocol: TCP readinessProbe: failureThreshold: 3 httpGet: path: /readyz port: https scheme: HTTPS initialDelaySeconds: 20 periodSeconds: 10 resources: requests: cpu: 100m memory: 200Mi securityContext: readOnlyRootFilesystem: true runAsNonRoot: true runAsUser: 1000 volumeMounts: - mountPath: /tmp name: tmp-dir nodeSelector: kubernetes.io/os: linux priorityClassName: system-cluster-critical serviceAccountName: metrics-server volumes: - emptyDir: {} name: tmp-dir --- apiVersion: apiregistration.k8s.io/v1 kind: APIService metadata: labels: k8s-app: metrics-server name: v1beta1.metrics.k8s.io spec: group: metrics.k8s.io groupPriorityMinimum: 100 insecureSkipTLSVerify: true service: name: metrics-server namespace: kube-system version: v1beta1 versionPriority: 100
版权声明: 本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若内容造成侵权/违法违规/事实不符,请联系邮箱:809451989@qq.com进行投诉反馈,一经查实,立即删除!
网站建设 2026/6/15 13:16:09

别墅地源热泵安装公司

专业别墅地源热泵安装,瑞冬集团为您打造恒温舒适生活在追求高品质生活的今天,别墅业主对室内环境舒适度的要求越来越高。传统空调系统往往难以满足大面积、多空间的温度调控需求,且运行成本高昂。地源热泵系统凭借其卓越的能效表现和稳定的运…

作者头像 李华
网站建设 2026/6/15 12:17:42

blender新手入门--常用的各类插件详细介绍

核心建模与流程 (Hard Surface & Workflow) BoxCutter & Hard Ops 9 (HOps): * 介绍: 这是 Blender 硬表面建模的“黄金搭档”。BoxCutter 专注于极致流畅的布尔运算(切削、切割、抽取);Hard Ops 则提供了一整套工具栏和快…

作者头像 李华
网站建设 2026/6/15 12:17:10

2025 数通 HCIE 改革后还值不值?

身边不少网工朋友都在纠结:2025年数通HCIE新增排错模块、通过率骤降,现在考HCIE数通认证还值不值?毕竟备考要花不少时间精力,谁都怕考了白忙活。结合今年的改革细节和招聘市场实情,今天就用大白话捋清楚这个问题。一、…

作者头像 李华
网站建设 2026/6/15 13:14:02

【顶级开发者私藏】:VSCode对接量子处理器的7个隐秘测试流程

第一章:VSCode 量子硬件的适配测试在探索量子计算开发环境的过程中,VSCode 凭借其强大的插件生态和可扩展性,成为连接经典编程与量子硬件的重要桥梁。通过集成 Q#、Qiskit 等量子开发框架,VSCode 能够实现对真实量子处理器&#x…

作者头像 李华
网站建设 2026/6/15 6:30:47

MCU+AT,必将让位于OpenCPU【第三章】

第三章:OpenCPU架构的原理、运行机制与演进逻辑能否让功能日益强大的通信模组自己承担所有计算与控制任务,从而开启一个更高效,让模组“自己思考”的新时代?这正是OpenCPU架构所实现的革命性跨越。3.1从“外设”到“主机”&#x…

作者头像 李华
网站建设 2026/6/14 14:17:13

【稀缺资源】资深工程师私藏的Azure QDK API文档阅读方法论

第一章:Azure QDK API文档的核心价值与应用场景 Azure Quantum Development Kit(QDK)API文档为量子计算开发者提供了构建、仿真和优化量子算法的关键支持。它不仅定义了语言级抽象与运行时接口,还统一了经典计算与量子操作的交互范…

作者头像 李华