fal-vision
Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.
Scripts
| Script | Purpose |
|--------|---------|
| analyze.sh | Analyze an image (segment, detect, OCR, describe, QA) |
Usage
Segment Objects
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation segment --query "the red car"
Detect Objects
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation detect
Extract Text (OCR)
./scripts/analyze.sh --image-url "https://example.com/document.jpg" --operation ocr
Describe Image
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation describe
Visual QA
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation qa --query "How many people are in this image?"
Arguments
| Argument | Description | Required |
|----------|-------------|----------|
| --image-url | URL of image to analyze | Yes |
| --operation | segment, detect, ocr, describe, qa | Yes |
| --query / -q | Text prompt for segment/qa operations | For segment/qa |
| --model / -m | Override model endpoint | No |
Finding Models
To discover the best and latest vision/analysis models, use the search API:
# Search for segmentation models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "segmentation"
# Search for object detection models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "object detection"
# Search for OCR models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "ocr"
# Search for image captioning / visual QA models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "caption"
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "visual question"
Or use the search_models MCP tool with keywords like "segmentation", "detection", "ocr", "caption", "vision".
Scan to join WeChat group