Segment-Everything-Everywhe.../README.md

24 lines
1.3 KiB
Markdown
Raw Normal View History

2023-04-12 07:46:08 +08:00
# Segment-Everything-Everywhere-All-at-Once
2023-04-12 12:31:46 +08:00
![SEEM design](assets/model.jpg?raw=true)
2023-04-12 07:46:08 +08:00
This paper presents a model **SEEM** that can Segment Everything Everywhere all at once. Our SEEM allows users to easily segment visual an image using prompts of different types including visual prompts (points, marks, boxes, scribbles and image segments) and language prompts (text and audio), etc. It can also handle any combination of prompts or generalize to custom prompts.
We emphasize $4$ important features of **SEE** here.
1. Versatility: work on various types of prompts;
2. Compositionaliy: deal with any compositions of prompts;
3. Interactive: dealmulti-round interactions with human because **SEE** has a memory prompt to store the session history;
4. Semantic awareness: give a semantic label to any predicted mask;
2023-04-12 12:47:48 +08:00
![SEEM design](assets/compare_with_sam.jpg?raw=true)
## :robot: Run click to mask demo
<!-- should show an example image here -->
## :golfing: Run text to mask demo
<!-- should show an example image here -->
## :skier: Run example image to mask demo
<!-- should show an example image here -->
## :cupid: Acknowledgements
- [X-Decoder](https://github.com/microsoft/X-Decoder)
## Citation
If you find this project helpful for your research, please consider citing the following BibTeX entry.
```BibTex
```