当前位置：首页 > news >正文

MMSeg无法使用单类自定义数据集训练

news 2025/7/9 16:31:15

文章首发及后续更新：https://mwhls.top/4423.html，无图/无目录/格式错误/更多相关请至首发页查看。
新的更新内容请到mwhls.top查看。
欢迎提出任何疑问及批评，非常感谢！

摘要：将三通道图像转为一通道图像，并将类别的通道值统一为0, 1, 2，以解决MMSeg的报错与无法训练问题

描述

代码

描述

跑自定义数据集时报错，理论上其它东西都没错，那就只能是图片问题。
但我这次弄了两个数据集，上一个虽然也报这个错，不过用某些方式解决了，可行的数据集的 GT 是彩色图片，报错是黑白图片，检查发现黑白图片也是三通道，那就不该是通道问题。
但查官方 issue 后，发现他们推荐单通道：https://github.com/open-mmlab/mmsegmentation/issues/1625#issuecomment-1140384065
在更改为单通道后，以下报错消失，但出现了新的问题，指标/损失异常

ValueError: Input and output must have the same number of spatial dimensions, but got input with with spatial dimensions of [128, 128] and output size of torch.Size([512, 512, 3]). Please provide input tensor in (N, C, d1, d2, ...,dK) format and output size in (o1, o2, ...,oK) format.

多次测试，将单分类分为两类，背景类与目标类，分别对应像素值 0, 1 （值域 0-255），而后解决。
但出现目标类难训练，改变损失权重后解决。
- ref: https://blog.patrickcty.cc/2021/05/21/mmsegmentation%E4%BA%8C%E5%88%86%E7%B1%BB%E9%85%8D%E7%BD%AE/
顺带一提，MMSeg 说更新了类别为 1 时的处理，但我更新到最新版后依然和老版一样。
- 见：https://github.com/open-mmlab/mmsegmentation/pull/2016

代码

排序代码有点难写，不想动脑子，因此只有一个量体裁衣的代码。
给定图片背景值为 (0, 0, 0)，目标值为 (255, 255, 255)，代码将其改为 (0) 与 (1)
以下两个代码放在同级文件夹下，运行 chennel3to1.py，输入待处理文件夹（支持递归），输出结果见 log 文件夹。
- 我其实写了好多类似的小工具，但是就传了最初版到 GitHub 上，太懒了…

# channel3to1.py
from base_model import BaseModel
import cv2
import numpy as npclass Channels3to1(BaseModel):def __init__(self):super().__init__()self.change_log_path()passdef run(self):path = input("Input path: ")files_path = self.get_path_content(path, 'allfile')self.log(f"Path: {path}")for i, file_path in enumerate(files_path):self.log(f"{i+1}: {file_path}")for i, file_path in enumerate(files_path):img = cv2.imread(file_path)H, W, C = img.shapeimg = img[:, :, 0].tolist()for h in range(H):for w in range(W):if img[h][w] != 0:img[h][w] = [1]else:img[h][w] = [0]img = np.array(img)save_path = self.log_dir + "/"+ self.path2name(file_path, keep_ext=True)cv2.imwrite(save_path, img, [int(cv2.IMWRITE_PNG_COMPRESSION), 0])self.log(f"{i+1}: {file_path} converted (H, W, 3) -> (H, W, 1) to {save_path}")if __name__ == "__main__":Channels3to1().run()

# base_model.py
import os
import os.path as osp
import re
import json
import time
import datetimeclass BaseModel():"""BaseModel, call it "utils" is OK."""def __init__(self, log_dir='', lang='en'):if log_dir == '':self.log_root = f"./log/{self.__class__.__name__}"else:self.log_root = log_dirself.log_dir = self.log_rootself.timestamp = time.time()self.log_file = f"{self.__class__.__name__}_{self.timestamp}.log"# self.lang_path = "./languages"# self.lang_dict = {#     "en": "English.json",#     "zh": "Chinese.json"# }# self.lang_encoding = {#     "en": "utf-8",#     "zh": "gb18030"# }# self.lang = {}# self.parse_from_language("zh")def help(self): """ Help functionPrint the help message"""self.log(self.__doc__)def change_log_path(self, mode="timestamp"):if mode == "timestamp":self.log_dir = osp.join(self.log_root, str(self.timestamp))elif mode == "root":self.log_dir = self.log_rootdef init_log_file(self):self.log_file = f"{self.__class__.__name__}_{time.time()}.log"def get_path_content(self, path, mode='allfile'):"""mode:allfile: All files in path, including files in subfolders.file: Files in path, only including files in this dir: pathdir: Dirs in path, only including Dir in this dir: path"""path_content = []index = 0for root, dirs, files in os.walk(path):index += 1if mode == 'allfile':for file in files:file_path = osp.join(root, file)path_content.append(file_path)if mode == 'file':for file in files:file_path = osp.join(root, file)path_content.append(file_path)breakif mode == 'dir':for dir in dirs:dir_path = osp.join(root, dir)path_content.append(dir_path)breakreturn path_contentdef is_file_meet(self, file_path, condition={'size_max': '10M', 'size_min': '10M', 'ext_allow': ['pth', 'pt', 't'],'ext_forbid': ['pth', 'pt', 't'],'name_allow': ['epoch_99.t'],'name_forbid': ['epoch_99.t']}):meet = Truefor k, v in condition.items():if k == 'size_max':# file size should <= size_maxmax_value = self.unit_conversion(v, 'B')file_size = os.path.getsize(file_path)if not file_size <= max_value:meet = Falseelif k == 'size_min':# file size should >= size_minmin_value = self.unit_conversion(v, 'B')file_size = os.path.getsize(file_path)if not file_size >= min_value:meet = Falseelif k == 'ext_allow':# file's extension name should in ext_allow[]_, file_name = os.path.split(file_path)_, ext = os.path.splitext(file_name)ext = ext[1:]if not ext in v:meet = Falseelif k == 'ext_forbid':# file's extension name shouldn't in ext_forbid[]_, file_name = os.path.split(file_path)_, ext = os.path.splitext(file_name)ext = ext[1:]if ext in v:meet = Falseelif k == 'name_allow':# file's name should in name_allow[]_, file_name = os.path.split(file_path)if not file_name in v:meet = Falseelif k == 'name_forbid':# file's name shouldn't in name_forbid[]_, file_name = os.path.split(file_path)if file_name in v:meet = Falsereturn meetdef unit_conversion(self, size, output_unit='B'):# convert [GB, MB, KB, B] to [GB, MB, KB, B]if not isinstance(size, str):return size# to Bytesize = size.upper()if 'GB' == size[-2:] or 'G' == size[-1]:size = size.replace("G", '')size = size.replace("B", '')size_num = float(size)size_num = size_num * 1024 * 1024 * 1024elif 'MB' == size[-2:] or 'M' == size[-1]:size = size.replace("M", '')size = size.replace("B", '')size_num = float(size)size_num = size_num * 1024 * 1024elif 'KB' == size[-2:] or 'K' == size[-1]:size = size.replace("K", '')size = size.replace("B", '')size_num = float(size)size_num = size_num * 1024elif 'B' == size[-1]:size = size.replace("B", '')size_num = float(size)else:raise# to output_unitif output_unit in ['GB', 'G']:size_num = size_num / 1024 / 1024 / 1024if output_unit in ['MB', 'M']:size_num = size_num / 1024 / 1024if output_unit in ['KB', 'K']:size_num = size_num / 1024if output_unit in ['B']:size_num = size_num# returnreturn size_numdef mkdir(self, path):if not osp.exists(path):os.makedirs(path)def split_content(self, content):if isinstance(content[0], str):content_split = []for path in content:content_split.append(osp.split(path))return content_splitelif isinstance(content[0], list):contents_split = []for group in content:content_split = []for path in group:content_split.append(osp.split(path))contents_split.append(content_split)return contents_splitdef path_to_last_dir(self, path):dirname = osp.dirname(path)last_dir = osp.basename(dirname)return last_dirdef path2name(self, path, keep_ext=False):_, filename = osp.split(path)if keep_ext:return filenamefile, _ = osp.splitext(filename)return filedef sort_list(self, list):# copy from: https://www.modb.pro/db/162223# To make 1, 10, 2, 20, 3, 4, 5 -> 1, 2, 3, 4, 5, 10, 20list = sorted(list, key=lambda s: [int(s) if s.isdigit() else s for s in sum(re.findall(r'(\D+)(\d+)', 'a'+s+'0'), ())])return listdef file_last_subtract_1(self, path, mode='-'):"""Just for myself.file:xxx.png 1ccc.png 2---> mode='-' --->file:xxx.png 0ccc.png 1"""with open(path, 'r') as f:lines = f.readlines()res = []for line in lines:last = -2 if line[-1] == '\n' else -1line1, line2 = line[:last], line[last]if mode == '-':line2 = str(int(line2) - 1)elif mode == '+':line2 = str(int(line2) + 1)line = line1 + line2 + "\n"if last == -1:line = line1 + line2res.append(line)with open(path, 'w') as f:f.write("".join(res))def log(self, content):time_now = datetime.datetime.now()content = f"{time_now}: {content}\n"self.log2file(content, self.log_file, mode='a')print(content, end='')def append2file(self, path, text):with open(path, 'a') as f:f.write(text)def log2file(self, content, log_path='log.txt', mode='w', show=False):self.mkdir(self.log_dir)path = osp.join(self.log_dir, log_path)with open(path, mode, encoding='utf8') as f:if isinstance(content, list):f.write("".join(content))elif isinstance(content, str):f.write(content)elif isinstance(content, dict):json.dump(content, f, indent=2, sort_keys=True, ensure_ascii=False)else:f.write(str(content))if show:self.log(f"Log save to: {path}")def list2tuple2str(self, list):return str(tuple(list))def dict_plus(self, dict, key, value=1):if key in dict.keys():dict[key] += valueelse:dict[key] = valuedef sort_by_label(self, path_label_list):"""list:["mwhls.jpg 1",                      # path and label"mwhls.png 0",                      # path and label"mwhls.gif 0"]                      # path and label-->list:[["0", "1"],                         # label["mwhls.png 0", "mwhls.gif 0"],     # class 0["mwhls.jpg 1"]]                    # class 1"""label_list = []for path_label in path_label_list:label = path_label.split()[-1]label_list.append(label)label_set = set(label_list)res_list = []res_list.append(list(label_set))for label in label_set:index_equal = []    # why index_equal = label_list == label isn't working?for i, lab in enumerate(label_list):if lab == label:index_equal.append(i)res = [path_label_list[i] for i in index_equal] # why path_label_list[index_equal] isn't working either??res_list.append(res)return res_listdef clear_taobao_link(self, text):# try:link = "https://item.taobao.com/item.htm?"try:id_index_1 = text.index('&id=') + 1id_index = id_index_1except:passtry:id_index_2 = text.index('?id=') + 1id_index = id_index_2except:passtry:id = text[id_index: id_index+15]text = link + idexcept:passreturn text# except:#     return textdef parse_from_language(self, lang='en'):path = osp.join(self.lang_path, self.lang_dict[lang])with open(path, "rb") as f:self.lang = json.load(f)if __name__ == '__main__':# .py to .exe# os.system("pyinstaller -F main.py")# print(get_path_content("test2"))# file_last_subtract_1("path_label.txt")pass

MMSeg无法使用单类自定义数据集训练

文章首发及后续更新：https://mwhls.top/4423.html，无图/无目录/格式错误/更多相关请至首发页查看。新的更新内容请到mwhls.top查看。欢迎提出任何疑问及批评，非常感谢！ 摘要：将三通道图像转为一通道图像，…...

编程日记 2023/2/13 4:54:03

Redis使用方式

一、Redis基础部分: 1、redis介绍与安装比mysql快10倍以上 *****************redis适用场合**************** 1.取最新N个数据的操作 2.排行榜应用,取TOP N 操作 3.需要精确设定过期时间的应用 4.计数器应用 5.Uniq操作,获取某段时间所有数据排重值 6.实时系统,反垃圾系统7.P…...

编程日记 2023/2/13 4:52:51

无主之地3重型武器节奏评分榜（9.25）枪械名红字效果元素属性清图评分 Boss战评分泛用性评分特殊性评分最终评级掉落点掉率图片瘟疫传播

无主之地3重型武器节奏评分榜（9.25） 枪械名红字效果元素属性清图评分 Boss战评分泛用性评分特殊性评分最终评级掉落点掉率图片瘟疫传播者发射巨大能量球，能量球会额外生成追踪附近敌人的伴生弹全属性 SSS SSS SSS - T0 伊甸6号-…...

编程日记 2023/2/13 4:51:41

什么是编程什么是算法

1.绪论编程应在一个开发环境中完成源程序的编译和运行。首先，发现高级语言开发环境，TC，Windows系统的C++，R语言更适合数学专业的学生。然后学习掌握编程的方法，在学校学习，有时间的人可以在网上学习，或者购买教材自学。最后，编写源程序，并且在开发环境中实践。例如…...

编程日记 2023/2/13 4:50:30

【c++】函数

文章目录函数的定义函数的调用值传递常见样式函数的声明函数的分文件编写函数的作用： 将一段经常使用的代码封装起来，减少重复代码。一个较大的程序，一般分为若干个程序块，每个模板实现特定的功能。函数的定义返回值类型函数…...

编程日记 2023/2/13 4:49:19

[golang gin框架] 1.Gin环境搭建,程序的热加载,路由GET,POST,PUT,DELETE

一.Gin 介绍Gin 是一个 Go (Golang) 编写的轻量级 http web 框架，运行速度非常快，如果你是性能和高效的追求者，推荐你使用 Gin 框架.Gin 最擅长的就是 Api 接口的高并发，如果项目的规模不大，业务相对简单，这…...

编程日记 2023/2/13 4:48:06

【开源】祁启云网络验证系统V1.11

简介祁启云免费验证系统一个使用golang语言、Web框架beego、前端Naive-Ui-Admin开发的免费网络验证系统版本当前版本1.11 更新方法请直接将本目录中的verification.exe/verification直接覆盖到你服务器部署的目录，更新前，请先关闭正在运行的验…...

编程日记 2023/2/13 4:46:56

震源机制(Focal Mechanisms)之沙滩球(Bench Ball)

沙滩球包含如下信息： a - 判断断层类型，可根据球的颜色快速判断 b - 判断断层的走向(strike)，倾角(dip) c - 确定滑移角/滑动角(rake) 走向 ，倾角，滑移角如不了解断层的定义，可以先阅读：震…...

编程日记 2023/2/13 4:45:46

C++入门：多态

多态按字面的意思就是多种形态。当类之间存在层次结构，并且类之间是通过继承关联时，就会用到多态。C 多态意味着调用成员函数时，会根据调用函数的对象的类型来执行不同的函数。1、纯虚函数声明如下： virtual void funtion1()0; 纯…...

编程日记 2023/2/13 4:44:35

华为OD真题_工位序列统计友好度最大值（100分）（C++实现）

题目描述工位由序列F1,F2…Fn组成，Fi值为0、1或2。其中0代表空置，1代表有人，2代表障碍物。 1、某一空位的友好度为左右连续老员工数之和 2、为方便新员工学习求助，优先安排友好度高的空位给出工位序列，求所有空位中友好度的最大值。输入描述第一行为工位序列：F1,F…...

编程日记 2023/2/13 4:43:23

[ruby on rails]MD5、SHA1、SHA256、Base64、aes-128-cbc、aes-256-ecb

md5 puts Digest::MD5.hexdigest(admin) sha1 require digest/sha1 puts Digest::SHA1.hexdigest(admin)base64 require base64 code Base64.encode64(admin) source Base64.decode64(code)aes-128-cbc # base64 解密数据 session_key Base64.decode64(session_ke…...

编程日记 2023/2/13 4:42:12

《NFL星计划》：拉斯维加斯突袭者·橄榄1号位

拉斯维加斯袭击者（英语： Las Vegas Raiders）又译拉斯维加斯侵略者或拉斯维加斯突击者，是一支主场位于美国内华达州拉斯维加斯的职业美式橄榄球球队，属全国橄榄球联盟 (NFL) 的美国橄榄球联合会 (AFC) 西区。实际上&…...

编程日记 2023/2/13 4:40:58

韩顺平Linux基础学习（1）

内容概括...

编程日记 2023/2/13 4:39:46

Rust学习入门--【6】Rust 基础语法

Rust 基础语法变量，数据类型，注释，函数和控制流，这些是大部分编程语言都具有的编程概念。本节将学习理解这些概念。变量 Rust 是强类型语言，但具有自动判断变量类型的能力。这很容易让人与弱类型语言产生混淆。…...

编程日记 2023/2/13 4:38:35

LINUX提权入门手册

前言发点存货 LINUX权限简介在学习提权之前我们先了解一下linux里面的权限我们使用命令: ls -al即可查看列出文件所属的权限： 文件头前面都有一段类似的字符，下面我们仔细分析一下里面符号分别代表什么。 -rw-r--r-- 1 root root 第一个符号-的…...

编程日记 2023/2/13 4:37:13

MSI_MSI-X中断之源码分析

MSI_MSI-X中断之源码分析文章目录MSI_MSI-X中断之源码分析一、怎么发出MSI/MSI-X中断1.1 在RK3399上体验1.1.1 安装工具1.1.2 查看设备MSI-X信息1.1.3 验证MSI-X信息二、怎么使用MSI/MSI-X三、 MSI/MSI-X中断源码分析3.1 IRQ Domain创建流程3.1.1 GIC3.1.2 ITS3.1.3 PCI MSI…...

编程日记 2023/2/13 4:36:02

Docker--consul

目录前言一、Consul 简介 1.1、 consul 概述 1.2 、consul 的两种模式 1.3、consul 提供的一些关键特性二、Consul 容器服务更新与发现三、consul 部署 3.2、查看集群信息四、registrator服务器 consul-template 五、consul 多节点前言服务注册与发现是微服…...

编程日记 2023/2/13 4:33:42

ESP-01S使用AT指令连接阿里云

ESP-01S使用AT指令连接阿里云烧录MQTT AT固件出厂的ESP-01S是基本的AT指令固件，没有MQTT的，所以无法通过MQTT指令与云平台通信，需要烧录固件（如果测试到有MQTT相关的指令，则不用重新烧录固件） 固件烧录…...

编程日记 2023/2/13 4:32:31

【Kafka】【三】安装Kafka服务器

Kafka基本知识 Kafka介绍 Kafka是最初由Linkedin公司开发，是⼀个分布式、⽀持分区的（partition）、多副本的 （replica），基于zookeeper协调的分布式消息系统，它的最⼤的特性就是可以实时的处理 …...

编程日记 2023/2/13 4:31:21

关于适配器模式，我遗漏了什么

近期有些tasks需要重构or适配老的代码。与其向上面堆💩，不如优雅的去解决。首先我的问题在于，错误的把堆屎的操作 ，当作了适配器模式的操作。比如原函数入参，需要更改某个属性，把这种操作外包一层…...

编程日记 2023/2/13 4:30:11

Ubuntu系统下交叉编译openssl

一、参考资料 OpenSSL&&libcurl库的交叉编译 - hesetone - 博客园二、准备工作 1. 编译环境宿主机：Ubuntu 20.04.6 LTSHost：ARM32位交叉编译器：arm-linux-gnueabihf-gcc-11.1.0 2. 设置交叉编译工具链在交叉编译之前&#x…...

编程新知 2025/6/17 7:22:49

c#开发AI模型对话

AI模型前面已经介绍了一般AI模型本地部署，直接调用现成的模型数据。这里主要讲述讲接口集成到我们自己的程序中使用方式。微软提供了ML.NET来开发和使用AI模型，但是目前国内可能使用不多，至少实践例子很少看见。开发训练模型就不介绍了&am…...

编程新知 2025/7/5 15:36:39

企业如何增强终端安全？

在数字化转型加速的今天，企业的业务运行越来越依赖于终端设备。从员工的笔记本电脑、智能手机，到工厂里的物联网设备、智能传感器，这些终端构成了企业与外部世界连接的 “神经末梢”。然而，随着远程办公的常态化和设备接入的爆炸式…...

编程新知 2025/7/7 21:10:42

以光量子为例，详解量子获取方式

光量子技术获取量子比特可在室温下进行。该方式有望通过与名为硅光子学（silicon photonics）的光波导（optical waveguide）芯片制造技术和光纤等光通信技术相结合来实现量子计算机。量子力学中，光既是波又是粒子。光子本…...

编程新知 2025/7/5 12:24:36

让回归模型不再被异常值“带跑偏“，MSE和Cauchy损失函数在噪声数据环境下的实战对比

在机器学习的回归分析中，损失函数的选择对模型性能具有决定性影响。均方误差（MSE）作为经典的损失函数，在处理干净数据时表现优异，但在面对包含异常值的噪声数据时，其对大误差的二次惩罚机制往往导致模型参数…...

编程新知 2025/7/8 4:19:56

Python基于历史模拟方法实现投资组合风险管理的VaR与ES模型项目实战

说明：这是一个机器学习实战项目（附带数据代码文档），如需数据代码文档可以直接到文章最后关注获取。 1.项目背景在金融市场日益复杂和波动加剧的背景下，风险管理成为金融机构和个人投资者关注的核心议题之一。VaR&…...

编程新知 2025/7/7 22:29:30

AI+无人机如何守护濒危物种？YOLOv8实现95%精准识别

【导读】野生动物监测在理解和保护生态系统中发挥着至关重要的作用。然而，传统的野生动物观察方法往往耗时耗力、成本高昂且范围有限。无人机的出现为野生动物监测提供了有前景的替代方案，能够实现大范围覆盖并远程采集数据。尽管具备这些优势&#xf…...

编程新知 2025/7/2 1:37:28

解决：Android studio 编译后报错\app\src\main\cpp\CMakeLists.txt‘ to exist

现象： android studio报错： [CXX1409] D:\GitLab\xxxxx\app.cxx\Debug\3f3w4y1i\arm64-v8a\android_gradle_build.json : expected buildFiles file ‘D:\GitLab\xxxxx\app\src\main\cpp\CMakeLists.txt’ to exist 解决： 不要动CMakeLists.…...

编程新知 2025/7/8 11:40:20

BLEU评分：机器翻译质量评估的黄金标准

BLEU评分：机器翻译质量评估的黄金标准 1. 引言在自然语言处理(NLP)领域，衡量一个机器翻译模型的性能至关重要。BLEU (Bilingual Evaluation Understudy) 作为一种自动化评估指标，自2002年由IBM的Kishore Papineni等人提出以来，…...

编程新知 2025/7/7 18:25:34

前端高频面试题2：浏览器/计算机网络

本专栏相关链接前端高频面试题1：HTML/CSS 前端高频面试题2：浏览器/计算机网络前端高频面试题3：JavaScript 1.什么是强缓存、协商缓存？ 强缓存： 当浏览器请求资源时，首先检查本地缓存是否命中。如果命…...

编程新知 2025/6/15 14:38:19

描述

代码

相关文章：