45 lines
1.2 KiB
Markdown
45 lines
1.2 KiB
Markdown
# Installation Guide
|
|
|
|
**General Environment**
|
|
* Linux System
|
|
* CUDA enabled GPU with Memory > 8GB (Evaluation)
|
|
* CUDA enabled GPU with Memory > 12GB (Training)
|
|
|
|
**Installation**
|
|
|
|
```sh
|
|
# Python Package Installation
|
|
pip install -r assets/requirements/requirements.txt
|
|
pip install -r assets/requirements/requirements_custom.txt
|
|
|
|
# Customer Operator [only need training deformable vision encoder]
|
|
cd modeling/vision/encoder/ops && sh make.sh && cd ../../../../
|
|
|
|
# System Package [only need for demo in SEEM]
|
|
sudo apt update
|
|
sudo apt install ffmpeg
|
|
```
|
|
|
|
**Dataset Preparation**
|
|
|
|
Please refer to [DATASET.md](assets/readmes/DATASET.md).
|
|
|
|
**Evaluation Tool**
|
|
```sh
|
|
# save coco_caption.zip to .xdecoder_data
|
|
wget https://huggingface.co/xdecoder/X-Decoder/resolve/main/coco_caption.zip
|
|
unzip coco_caption.zip
|
|
```
|
|
|
|
**Environment Variables**
|
|
```sh
|
|
export DETECTRON2_DATASETS=/pth/to/xdecoder_data
|
|
export DATASET=/pth/to/xdecoder_data
|
|
export DATASET2=/pth/to/xdecoder_data
|
|
export VLDATASET=/pth/to/xdecoder_data
|
|
export PATH=$PATH:/pth/to/xdecoder_data/coco_caption/jre1.8.0_321/bin
|
|
export PYTHONPATH=$PYTHONPATH:/pth/to/xdecoder_data/coco_caption
|
|
```
|
|
|
|
**Pretrained Checkpoint**
|