Hi Stikker Uipko,
Yes, this is a common real-world scenario when working with scanned delivery notes or invoices that include both printed and handwritten information.
You’re successfully extracting printed content using Azure Document Intelligence. However, you're encountering difficulties with:
Identifying and isolating handwritten notes such as “Not received”.
Mapping these handwritten annotations to the corresponding table rows or items.
Interpreting handwritten overwrites or strike-throughs in context.
To address this scenario effectively, you can consider the following approach:
- Enable Handwriting Extraction
Ensure you're using the latest Azure Document Intelligence API version (e.g., 2023-10-31-preview
) and enable the handwriting
feature in your request:
"features": ["handwriting"]
This allows the model to extract handwritten content in addition to printed text.
- Use Bounding Box Geometry for Association
Both table elements and handwritten annotations come with bounding box coordinates. You can use spatial analysis to match each handwritten note to the nearest table row or cell:
Compare the handwritten text’s position with the table row’s bounding box.
If it overlaps or is near a specific row, assign the note to that row programmatically.
This can be achieved by writing post-processing logic that evaluates the geometric overlap.
- Add Custom Post-Processing Logic
Since the model does not natively associate handwriting with specific rows:
Build a custom script or logic layer that appends a "handwritten_notes"
field to each row.
Use the bounding box comparison results to determine the correct mapping.
Example Output:
"tableRows": [
{
"rowId": 5,
"fields": {
"ItemNo": "330703",
"Qty": "2"
},
"handwritten_notes": ["Not received"]
}
]
- Optional Enhancements
Use Azure Form Recognizer Studio to visualize bounding boxes and fine-tune your logic.
If working with large volumes, consider training a custom layout model to recognize handwritten overrides directly.
For advanced use cases, integrate with other services like Ink Analysis API (if you’re capturing digital pen strokes).
Apart from the above methods please ensure that you have
- Make sure handwriting is legible and has good contrast in the scan.
- Strike-throughs or overwritten digits may be inconsistently picked up, depending on clarity.
Let us know if you need further assistance—we’d be happy to help!
Best regards,
Chakravarthi Rangarajan Bhargavi
If this answer was helpful, please click "Accept Answer" and upvote it so others in the community can find it more easily.