news 2026/5/1 11:40:09

【CANN训练营】体验基于Caffe ResNet-50网络实现图片分类实践操作

作者头像

张小明

前端开发工程师

1.2k 24
文章封面图
【CANN训练营】体验基于Caffe ResNet-50网络实现图片分类实践操作

实例功能

很简单的一个实例,功能就是一个实现图片分类的功能,然后拓展实现以下

  • 将一张YUV420SP格式的图片编码为*.jpg格式的图片。
  • 将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,缩放,再进行模型推理,分别得到两张图片的推理结果后,处理推理结果,输出最大置信度的类别标识以及top5置信度的总和。
  • 将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图,再进行模型推理,分别得到两张图片的推理结果后,处理推理结果,输出最大置信度的类别标识以及top5置信度的总和。
  • 将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图贴图,再进行模型推理,分别得到两张图片的推理结果后,处理推理结果,输出最大置信度的类别标识以及top5置信度的总和。
  • 将YUV420SP NV12格式的图片(分辨率8192*8192)缩放,得到4000*4000。

环境及环境版本介绍

NPU:Ascend910(32GB)

CANN版本:CANN-8.0.RC3.alpha001

开始实践

创建conda环境

conda create -n cann_demopython=3.8-y conda activate cann_demo

安装CANN

wgethttps://ascend-repo.obs.cn-east-2.myhuaweicloud.com/Milan-ASL/Milan-ASL%20V100R001C20SPC703/Ascend-cann-toolkit_8.0.0.alpha003_linux-aarch64.runbashAscend-cann-toolkit_8.0.0.alpha003_linux-aarch64.run --full

激活环境变量

source/home/ma-user/Ascend/ascend-toolkit/set_env.sh

下载体验代码仓

gitclone -b v0.3-8.0.0.alpha003 https://gitee.com/Ascend/samples

进入示例文件夹

cdsamples/cplusplus/level2_simple_inference/1_classification/vpc_jpeg_resnet50_imagenet_classification

获取ResNet-50原始模型

下载模型CAFFE文件

cdcaffe_modelwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/003_Atc_Models/AE/ATC%20Model/resnet50/resnet50.caffemodelwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/003_Atc_Models/AE/ATC%20Model/resnet50/resnet50.prototxtcd..

安装依赖

pipinstalldecorator attrs psutil sympy scipy

转换模型

atc --model=caffe_model/resnet50.prototxt --weight=caffe_model/resnet50.caffemodel --framework=0--soc_version=Ascend910 --insert_op_conf=caffe_model/aipp.cfg --output=model/resnet50_aipp

准备测试图片

cddatawgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/dvpp_vpc_8192x8192_nv12.yuvwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/persian_cat_1024_1536_283.jpgwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/wood_rabbit_1024_1061_330.jpgwgethttps://obs-9be7.obs.cn-east-2.myhuaweicloud.com/models/aclsample/wood_rabbit_1024_1068_nv12.yuvcd..

编译运行

安装依赖

condainstall-c conda-forge cmake condainstall-c conda-forge binutils

创建目录

mkdir-p build/intermediates/host

设置环境变量

source/home/ma-user/Ascend/ascend-toolkit/set_env.shexportDDK_PATH=$HOME/Ascend/ascend-toolkit/latestexportNPU_HOST_LIB=$DDK_PATH/runtime/lib64/stub

生成编译文件

cdbuild/intermediates/host cmake../../../src -DCMAKE_CXX_COMPILER=g++ -DCMAKE_SKIP_RPATH=TRUEmake

运行

设置main文件权限为可运行

cd ../../../out chmod +x main

将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,缩放,再进行模型推理,分别得到两张图片的推理结果

./main0

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] make directory successfully. [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] load model ../model/resnet50_aipp.om success [INFO] create model description success [INFO] create model output success [INFO] model input width 224, input height 224 [INFO] ------------------------------------------- [INFO] start to process picture:../data/persian_cat_1024_1536_283.jpg [INFO] call JpegD [INFO] call vpcResize [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[283], top1[0.969727], top5[0.979855] [INFO] ------------------------------------------- [INFO] start to process picture:../data/wood_rabbit_1024_1061_330.jpg [INFO] call JpegD [INFO] call vpcResize [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[331], top1[0.895508], top5[1.000134] [INFO] ------------------------------------------- [INFO] unload model success, modelId is 1 [INFO] destroy model description success [INFO] destroy model output success [INFO] execute sample success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图,再进行模型推理,分别得到两张图片的推理结果。

./main1

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] load model ../model/resnet50_aipp.om success [INFO] create model description success [INFO] create model output success [INFO] model input width 224, input height 224 [INFO] ------------------------------------------- [INFO] start to process picture:../data/persian_cat_1024_1536_283.jpg [INFO] call JpegD [INFO] call vpcCrop [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[283], top1[0.996094], top5[0.999629] [INFO] ------------------------------------------- [INFO] start to process picture:../data/wood_rabbit_1024_1061_330.jpg [INFO] call JpegD [INFO] call vpcCrop [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[330], top1[0.859863], top5[1.000106] [INFO] ------------------------------------------- [INFO] unload model success, modelId is 1 [INFO] destroy model description success [INFO] destroy model output success [INFO] execute sample success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将两张*.jpg格式的解码成两张YUV420SP NV12格式的图片,抠图贴图,再进行模型推理,分别得到两张图片的推理结果。

./main2

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] load model ../model/resnet50_aipp.om success [INFO] create model description success [INFO] create model output success [INFO] model input width 224, input height 224 [INFO] ------------------------------------------- [INFO] start to process picture:../data/persian_cat_1024_1536_283.jpg [INFO] call JpegD [INFO] call vpcCropAndPaste [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[283], top1[0.431885], top5[0.751892] [INFO] ------------------------------------------- [INFO] start to process picture:../data/wood_rabbit_1024_1061_330.jpg [INFO] call JpegD [INFO] call vpcCropAndPaste [INFO] Process dvpp success [INFO] create model input success [INFO] model execute success [INFO] destroy model input success [INFO] result : classType[330], top1[0.685059], top5[0.969410] [INFO] ------------------------------------------- [INFO] unload model success, modelId is 1 [INFO] destroy model description success [INFO] destroy model output success [INFO] execute sample success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将一张YUV420SP格式的图片编码为*.jpg格式的图片。

./main3

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp init resource success [INFO] start to jpege picture ../data/wood_rabbit_1024_1068_nv12.yuv [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

将一张分辨率为8192*8192的YUV420SP格式的图片缩放至4000*4000。

./main4

运行结果

[INFO] ./main param, param represents a vpc feature and must be set [INFO] start check result fold:./result [INFO] check result success, fold exist [INFO] acl init success [INFO] set device 0 success [INFO] create context success [INFO] create stream success [INFO] get run mode success [INFO] dvpp process 8k resize begin [INFO] dvpp init resource success [INFO] dvpp process 8k resize success [INFO] end to destroy stream [INFO] end to destroy context [INFO] end to reset device 0 [INFO] end to finalize acl

整体运行结果

执行可执行文件成功后,同时会在main文件同级的result目录下生成结果文件,便于后期查看。结果文件如下:

  • dvpp_output_0:persian_cat_1024_1536_283.jpg:图片经过缩放或抠图或抠图贴图之后的结果图片。
  • dvpp_output_1:wood_rabbit_1024_1061_330.jpg:图片经过缩放或抠图或抠图贴图之后的结果图片。
  • model_output_0:persian_cat_1024_1536_283.jpg:图片的模型推理结果,二进制文件。
  • model_output_0.txt:persian_cat_1024_1536_283.jpg:图片的模型推理结果,txt文件。
  • model_output_1:wood_rabbit_1024_1061_330.jpg:图片的模型推理结果,二进制文件。
  • model_output_1.txt:wood_rabbit_1024_1061_330.jpg:图片的模型推理结果,txt文件。
  • jpege_output_0.jpg:wood_rabbit_1024_1068_nv12.yuv:图片结果编码后的结果图片。
  • dvpp_vpc_4000x4000_nv12.yuv:dvpp_vpc_8192x8192_nv12.yuv:图片缩放后的结果图片。
版权声明: 本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若内容造成侵权/违法违规/事实不符,请联系邮箱:809451989@qq.com进行投诉反馈,一经查实,立即删除!
网站建设 2026/5/1 6:12:10

Vue3小白入门:官方文档这样读最有效

快速体验 打开 InsCode(快马)平台 https://www.inscode.net输入框内输入如下内容: 开发一个Vue3新手学习路径引导应用,包含:1) 分阶段学习路线图 2) 每个知识点的简化解释 3) 互动式代码练习区 4) 学习进度跟踪。设计要友好活泼&#xff0c…

作者头像 李华
网站建设 2026/5/1 7:20:20

DIFY安装与快速原型开发:1小时搞定AI应用

快速体验 打开 InsCode(快马)平台 https://www.inscode.net输入框内输入如下内容: 展示如何通过DIFY在1小时内快速开发一个AI应用原型。选择一个具体的应用场景(如情感分析或推荐系统),详细介绍从安装DIFY到完成原型的全过程。重…

作者头像 李华
网站建设 2026/5/1 8:42:13

开源图像识别模型部署难题破解:conda环境一键激活

开源图像识别模型部署难题破解:conda环境一键激活 万物识别-中文-通用领域:阿里开源的视觉理解新范式 在当前AI大模型快速发展的背景下,多模态理解能力正成为智能系统的核心竞争力之一。阿里巴巴近期开源的「万物识别-中文-通用领域」图像识别…

作者头像 李华
网站建设 2026/5/1 7:32:14

跨平台实战:在手机APP中集成中文万物识别模型的完整流程

跨平台实战:在手机APP中集成中文万物识别模型的完整流程 作为一名移动开发者,你是否想过在自己的APP中加入实时物体识别功能?比如让用户用手机摄像头扫一扫就能识别出周围的物体类别。听起来很酷对吧?但当你真正开始尝试时&#x…

作者头像 李华
网站建设 2026/5/1 8:38:22

小白必看:Chrome109离线版下载安装图解教程

快速体验 打开 InsCode(快马)平台 https://www.inscode.net输入框内输入如下内容: 设计一个交互式教程应用,通过分步引导帮助用户完成Chrome109离线版的获取和安装。包含:1) 官方下载源识别 2) 文件验证方法 3) 安装过程截图指导 4) 常见问…

作者头像 李华