Instructions to use google/pix2struct-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/pix2struct-base with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="google/pix2struct-base")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("google/pix2struct-base") model = AutoModelForImageTextToText.from_pretrained("google/pix2struct-base") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -74,7 +74,7 @@ This model was originally contributed by Kenton Lee, Mandar Joshi et al. and add
|
|
| 74 |
# Citation
|
| 75 |
|
| 76 |
If you want to cite this work, please consider citing the original paper:
|
| 77 |
-
|
| 78 |
@misc{https://doi.org/10.48550/arxiv.2210.03347,
|
| 79 |
doi = {10.48550/ARXIV.2210.03347},
|
| 80 |
|
|
@@ -92,3 +92,4 @@ If you want to cite this work, please consider citing the original paper:
|
|
| 92 |
|
| 93 |
copyright = {Creative Commons Attribution 4.0 International}
|
| 94 |
}
|
|
|
|
|
|
| 74 |
# Citation
|
| 75 |
|
| 76 |
If you want to cite this work, please consider citing the original paper:
|
| 77 |
+
```
|
| 78 |
@misc{https://doi.org/10.48550/arxiv.2210.03347,
|
| 79 |
doi = {10.48550/ARXIV.2210.03347},
|
| 80 |
|
|
|
|
| 92 |
|
| 93 |
copyright = {Creative Commons Attribution 4.0 International}
|
| 94 |
}
|
| 95 |
+
```
|