add readme for explainations
parent
049566bdc9
commit
dbe0ad8f21
Binary file not shown.
After Width: | Height: | Size: 472 KiB |
Binary file not shown.
After Width: | Height: | Size: 456 KiB |
|
@ -57,7 +57,14 @@ Marrying <a href="https://github.com/IDEA-Research/GroundingDINO">Grounding DINO
|
||||||
<img src="https://huggingface.co/ShilongLiu/GroundingDINO/resolve/main/GD_GLIGEN.png" alt="gd_gligen" width="100%">
|
<img src="https://huggingface.co/ShilongLiu/GroundingDINO/resolve/main/GD_GLIGEN.png" alt="gd_gligen" width="100%">
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
|
## :star: Explanation/Tips for Grounding DINO Inputs and Outputs
|
||||||
|
- Grounding DINO accepts with a `(image, text)` pair as inputs.
|
||||||
|
- It will outputs `900` (by default) object boxes. Each box has a similarity scores across all input words.
|
||||||
|
- We defaultly choose the boxes whose highest similarities are higher than a `box_threshold`.
|
||||||
|
- We clip the words whose similarities are higher than the `text_threshold` as predicted labels.
|
||||||
|
- If you want to obtain objects of certain phrases, like the `dogs` in the sentence `two dogs with a stick.`, you can select the boxes with highest text similarities with `dogs` as final outputs.
|
||||||
|

|
||||||
|

|
||||||
|
|
||||||
## :label: TODO
|
## :label: TODO
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue