Instructions to use google/pix2struct-screen2words-large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/pix2struct-screen2words-large with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="google/pix2struct-screen2words-large")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("google/pix2struct-screen2words-large") model = AutoModelForImageTextToText.from_pretrained("google/pix2struct-screen2words-large") - Notebooks
- Google Colab
- Kaggle
Commit History
Update README.md a77a669
Update config.json 518443f
Update config.json b1e796c
Update README.md ab31c4f
Create README.md 4b0966b
Upload processor dc08cb4
Upload Pix2StructForConditionalGeneration fe0fc02
initial commit 391d7d4
Younes Belkada commited on