anytable / README.md

qianliyx

Update README.md

223e029 verified 10 months ago

3.93 kB

	---
	license: apache-2.0
	pipeline_tag: object-detection
	tags:
	- table-detect
	- table-recong
	---
	# AnyTable

	<a href="https://huggingface.co/anyforge/anytable" target="_blank"><img src="https://img.shields.io/badge/%F0%9F%A4%97-HuggingFace-blue"></a>
	<a href="https://www.modelscope.cn/models/anyforge/anytable" target="_blank"><img alt="Static Badge" src="https://img.shields.io/badge/%E9%AD%94%E6%90%AD-ModelScope-blue"></a>
	<a href=""><img src="https://img.shields.io/badge/Python->=3.6-aff.svg"></a>
	<a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a>
	<a href=""><img alt="Static Badge" src="https://img.shields.io/badge/engine-cpu_gpu_onnxruntime-blue"></a>

	```
	___ ______ __ __
	/ \| ____ __ _/_ __/___ _/ /_ / /__
	/ /\| \| / __ \/ / / // / / __ `/ __ \/ / _ \
	/ ___ \|/ / / / /_/ // / / /_/ / /_/ / / __/
	/_/ \|_/_/ /_/\__, //_/ \__,_/_.___/_/\___/
	/____/

	```

	简体中文 \| [English](./README_en.md)

	<div align="left">
	<img src="./assets/sample1.jpg">
	</div>


	## 1. 简介

	AnyTable是一个专注于从文档或者图片中表格解析的模型工具，主要分成两个部分：
	- anytable-det：用于表格区域检测（已开放）
	- anytable-rec：用于表格结构识别（未来开放）

	项目地址：
	- github地址：[AnyTable](https://github.com/anyforge/anytable)
	- Hugging Face: [AnyTable](https://huggingface.co/anyforge/anytable)
	- ModelScope: [AnyTable](https://www.modelscope.cn/models/anyforge/anytable)

	## 2. 缘起

	目前市面上表格数据非常多且混杂，很难有一个干净的完整数据和模型，为此我们收集并整理了很多表格数据，训练了我们的模型。

	检测数据集分布：

	- pubtables: 947642
	- synthtabnet.marketing: 149999
	- tablebank: 278582
	- fintabnet.c: 97475
	- pubtabnet: 519030
	- synthtabnet.sparse: 150000
	- synthtabnet.fintabnet: 149999
	- docbank: 24517
	- synthtabnet.pubtabnet: 150000
	- cTDaRTRACKA: 1639
	- SciTSR: 14971
	- doclaynet.large: 21185
	- IIITAR13K: 9905
	- selfbuilt: 121157

	数据集总计：大于`2.6M`(大约2633869张图片)。

	### 扩展训练

	- 训练集：`2.6M（大于10万的部分只抽样了42000，没办法因为贫穷，卡有限。）`
	- 测试集：`4k`
	- python: 3.12
	- pytorch: 2.6.0
	- cuda: 12.3
	- ultralytics: 8.3.128

	### 模型介绍

	表格检测模型位于det文件夹下：
	- yolo系列：使用ultralytics训练yolo检测
	- rt-detr：使用ultralytics训练rt-detr检测

	注释：您可以直接模型预测，也可以作为预训练模型微调私有数据集

	### 评估

	自建评估集：`4K`

	\| model \| imgsz \| epochs \| metrics/precision \|
	\|---\|---\|---\|---\|
	\|rt-detr-l\|960\|10\|0.97\|
	\|yolo11s\|960\|10\|0.97\|
	\|yolo11m\|960\|10\|0.964\|
	\|yolo12s\|960\|10\|0.978\|


	## 3. 使用方法

	### 安装依赖

	```bash
	pip install ultralytics pillow
	```

	### 使用方法

	```python
	## simple
	## 下载模型后直接使用ultralytics即可

	from ultralytics import YOLO,RTDETR

	# Load a model
	model = YOLO("/path/to/download_model") # pretrained YOLO11n model

	# Run batched inference on a list of images
	results = model(["/path/to/your_image"],imgsz = 960) # return a list of Results objects

	# Process results list
	for result in results:
	boxes = result.boxes # Boxes object for bounding box outputs
	masks = result.masks # Masks object for segmentation masks outputs
	keypoints = result.keypoints # Keypoints object for pose outputs
	probs = result.probs # Probs object for classification outputs
	obb = result.obb # Oriented boxes object for OBB outputs
	result.show() # display to screen
	result.save(filename="result.jpg") # save to disk

	```

	## Buy me a coffee

	- 微信(WeChat)

	<div align="left">
	<img src="./zanshan.jpg" width="30%" height="30%">
	</div>

	## 特别鸣谢
	- ultralytics公开的训练模型和文档
	- 各种数据集提供者