Document Segmentation Assemble - DOSA

Installation and requirements

Tested for Ubuntu 18.04/20.04.

Use of a GPU significantly speeds up generation of detection outputs, but it is possible to run the inference demo code on CPU.

Python Virtual Environment

Set up python = 3.7.x environment: pyenv install 3.7.12 pyenv virtualenv 3.7.12 dosa-env
Activate the environment pyenv shell dosa-env
Update pip & setuptools python -m pip install --upgrade pip setuptools

Models required

Install requirements pip install -r requirements.txt
- (for GPU-enabled installation: pip install -r requirements_gpu.txt)

Mask R-CNN & DocParser

Install Mask R-CNN pip install -e ./Mask_RCNN
Install DocParser

pip install -e ./DocParser
Download model weights follow instruction in DocParser/docparser/default_models/README.md

PaddleOCR

Insall PaddlePaddle pip install paddlepaddle==2.1.3
- (for GPU-enabled installation: pip install paddlepaddle-gpu==2.1.3)

Installing paddlepaddle will raise warning error about dependency of gast==0.2.2 in tensorflow==1.15.5 vs. gast==0.4.0 in paddlepaddle==2.1.3. Just ignore it!

Insall PaddleOCR pip install -e ./PaddleOCR

fastAPI server

Install poetry following instruction https://github.com/python-poetry/poetry#osx--linux--bashonwindows-install-instructions
Install server dependencies poetry install

Run Server and Demo

Try each model with script in ./demos, or running API server in ./server and ./demos/server_api.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.vscode		.vscode
DocParser		DocParser
Mask_RCNN		Mask_RCNN
PaddleOCR		PaddleOCR
demos		demos
server		server
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
doc-segment.code-workspace		doc-segment.code-workspace
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements_gpu.txt		requirements_gpu.txt

License

rednam-ntn/dosa

Folders and files

Latest commit

History

Repository files navigation

Document Segmentation Assemble - DOSA

Installation and requirements

Python Virtual Environment

Models required

Mask R-CNN & DocParser

PaddleOCR

fastAPI server

Run Server and Demo

About

Resources

License

Stars

Watchers

Forks

Languages