Description
Estrae testo da immagini e PDF scansionati usando Tesseract.js (pure JS, no native binary, no API key). Lingue: italiano + inglese di default, configurabile. Output con bounding boxes, confidence score per blocco, threshold filter.
integration_ocr · action · SaaS Integrations · Available · v1.0.0
Estrae testo da immagini e PDF scansionati usando Tesseract.js (pure JS, no native binary, no API key). Lingue: italiano + inglese di default, configurabile. Output con bounding boxes, confidence score per blocco, threshold filter.
Fields shown in the editor when configuring the node. Generated directly from the NodeDefconfigFields.
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
actionAzione | enumextract_text | yes | extract_text | — |
sourceTipo sorgente | enumfile_pathbase64 | yes | — | file_path = path nel volume workspace. base64 = contenuto inline. |
contentPath o base64 | string (multiline) | yes | — | — |
languagesLingue (separate da virgola) | string | no | ita,eng | Codici Tesseract: ita, eng, fra, deu, spa, ... |
confidenceThresholdSoglia confidence (0-100) | number | no | 30 | Filtra blocchi sotto questa soglia. |
Fields available to downstream nodes via $node.<alias>.json.<field>:
textconfidenceblocksJSON snippet of the node as it appears in the workflow. Values are derived fromdefaultValue and from required parameters.
{
"id": "node-integration_ocr-1",
"defId": "integration_ocr",
"label": "OCR (Tesseract)",
"config": {
"action": "extract_text",
"source": "file_path",
"content": "<content>",
"languages": "ita,eng",
"confidenceThreshold": 30
}
}Local OCR (no cloud upload). Data stays on the tenant container, never sent to third parties. Higher-quality Google Vision OCR (cloud) option coming.
Available now on all FlowForge plans. Try it free without a credit card.
Start freeBrowse all nodes