Instructions to use theaiinstitute/theia-tiny-patch16-224-cddsv with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use theaiinstitute/theia-tiny-patch16-224-cddsv with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="theaiinstitute/theia-tiny-patch16-224-cddsv", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("theaiinstitute/theia-tiny-patch16-224-cddsv", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
| library_name: transformers | |
| license: other | |
| # Theia | |
| [The AI Institute](https://theaiinstitute.com/) | |
| Theia is a vision foundation model for robot learning that distills multiple off-the-shelf vision foundation models trained on varied vision tasks. Theia’s rich visual representations encode diverse visual knowledge, enhancing downstream robot learning. It was introduced in the paper [Theia: Distilling Diverse Vision Foundation Models for Robot Learning](https://arxiv.org/abs/2407.20179), which also includes experiments demonstrating that Theia outperforms its teacher | |
| models and prior robot learning models using less training data and smaller model sizes. Demo videos can be found on the [project page](http://theia.theaiinstitute.com/). | |
| <img src="https://raw.githubusercontent.com/bdaiinstitute/theia/main/doc/theia_overview.gif" height="300px"> | |
| ## Model Details | |
| The `theia-tiny-patch16-224-cddsv` model, uses [DeiT-Tiny](https://huggingface.co/facebook/deit-tiny-patch16-224) as a backbone, and simulatenously distills [CLIP](https://github.com/openai/CLIP), [Depth Anything](https://github.com/LiheYoung/Depth-Anything), [DINOv2](https://github.com/facebookresearch/dinov2), [Segment Anything](https://github.com/facebookresearch/segment-anything) and [ViT](https://github.com/google-research/vision_transformer). For more information on usage, please visit the [Theia repository](https://github.com/bdaiinstitute/theia/tree/main). | |
| ## Citation | |
| If you use Theia in your research, please use the following BibTeX entry: | |
| ```bibtex | |
| @article{shang2024theia, | |
| author = {Shang, Jinghuan and Schmeckpeper, Karl and May, Brandon B. and Minniti, Maria Vittoria and Kelestemur, Tarik and Watkins, David and Herlant, Laura}, | |
| title = {Theia: Distilling Diverse Vision Foundation Models for Robot Learning}, | |
| journal = {arXiv}, | |
| year = {2024}, | |
| } | |
| ``` | |
| ## Usage | |
| The pre-trained model weights and code released with Theia are available for use under [The AI Institute License](https://raw.githubusercontent.com/bdaiinstitute/theia/main/LICENSE), reproduced in full below: | |
| ``` | |
| Copyright (c) 2024 Boston Dynamics AI Institute LLC | |
| Redistribution and use in source and binary forms, with or without | |
| modification, are permitted provided that the following conditions are met: | |
| 1. Redistributions of source code must retain the copyright notice included | |
| with the software, this list of conditions and the following disclaimer. | |
| 2. Redistributions in binary form must reproduce the copyright notice, this | |
| list of conditions and the following disclaimer in the documentation and/or | |
| other materials provided with the distribution. | |
| 3. Modified versions of the software must be conspicuously marked as such. | |
| 4. The software may only be used for non-commercial research purposes. | |
| For profit enterprises may use the software, subject to this limitation. | |
| THIS SOFTWARE IS PROVIDED BY THE AI INSTITUTE AND CONTRIBUTORS "AS IS" AND | |
| ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, NON- | |
| INFRINGEMENT,TITLE, MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE | |
| DISCLAIMED. IN NO EVENT SHALL THE AI INSTITUTE OR CONTRIBUTORS BE LIABLE FOR | |
| ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, PUNITIVE OR CONSEQUENTIAL | |
| DAMAGES (INCLUDING, BUT NOT LIMITED TO, DAMAGES ARISING OUT OF CLAIMS OF | |
| INTELLECTUAL PROPERTY RIGHTS INFRINGEMENT; PROCUREMENT OF SUBSTITUTE GOODS OR | |
| SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER | |
| CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, | |
| OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE | |
| OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. | |
| ``` |