Improving Scientific Notation Accuracy with Azure Document Intelligence Layout Model

Question

Improving Scientific Notation Accuracy with Azure Document Intelligence Layout Model

Hongqian Li 65

We’re currently using Azure Document Intelligence’s Prebuilt Layout model to extract content from structured documents. It works well for layout and structure, but it does not accurately preserve scientific notations. For example, expressions like 2.0 × 10⁻⁵ are extracted as 2.0 X 10-5, and formatting such as superscripts/subscripts is lost.

Our goal is to retain the layout extraction capabilities of the Prebuilt Layout model, but enhance it with better accuracy for mathematical/scientific expressions.

Is it possible to build a custom model that extends the Prebuilt Layout model? That is, we want the same layout detection, but with improved text extraction (especially for scientific formats).
If not directly extendable, what’s the recommended approach to combine layout recognition with better scientific notation handling? For example:
1. How can we combine with Azure Vision OCR (Read API v4) to infer superscripts/subscripts more accurately?

We’re looking for implementation guidance or documentation that can help us bridge the gap without losing the benefits of the prebuilt layout model.

Thanks in advance for any insights or references!

Accepted answer

0 additional answers

Your answer

Answer 1

Hello Hongqian Li,

Welcome to the Microsoft Q&A and thank you for posting your questions here.

I understand that you would like to improve Scientific Notation Accuracy with Azure Document Intelligence Layout Model.

My best advice for you is to start with the hybrid pipeline combining:

Read API v4 for OCR
Prebuilt Layout for structure
Custom post-processing for notation formatting

Move to custom model if layout is consistent and notations are critical.

For your scenario requirement, tools and solution:

To preserve document layout, we use Azure DI's Prebuilt Layout Model.
For detecting superscript/scientific notations, we combine Azure Read API v4 with heuristics or leverage MathPix's Math OCR.
Accurate fusion of elements is achieved through custom logic implementing IoU matching with coordinate normalization.
LaTeX/MathML export capability is enabled via math-aware reconstruction using MathPix or custom solutions.
Handling custom templates involves field tagging and regex patterns through Azure DI's Custom Model.

References as requested:

Azure Read API: - https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/how-to/call-read-api
Document Intelligence Layout Model: - https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept-layout
MathPix OCR (Optional): - https://docs.mathpix.com/
MathML for Scientific Publishing: - https://developer.mozilla.org/en-US/docs/Web/MathML
Coordinate Alignment for OCR: - https://docs.opencv.org/4.x/da/d6e/tutorial\_py\_geometric\_transformations.html

I hope this is helpful! Do not hesitate to let me know if you have any other questions or clarifications.

Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.

Hongqian Li 65 Reputation points

2025-08-21T17:38:13.9666667+00:00

Hi Sina,

Thanks for the response. It helps a lot!

Are there any tutorials or documentations that can help with building the hybrid pipeline using Read API v4 for OCR + Prebuilt Layout for structure?

Thanks again for your help!
Sina Salam 24,096 Reputation points Volunteer Moderator

2025-08-22T10:37:47.7366667+00:00

Hi

Thank you for your feedback.

Regarding your new question:

Are there any tutorials or documentations that can help with building the hybrid pipeline using Read API v4 for OCR + Prebuilt Layout for structure?

Yes, there are several excellent tutorials and resources available to help you build a hybrid pipeline using Azure Read API v4 for OCR and Prebuilt Layout Model for structure.

Official Microsoft Documentation for Read API v4: Read model OCR data extraction - Azure Document Intelligence

GitHub Code Samples for Document Intelligence: Azure-Samples/document-intelligence-code-samples

REST API Tutorial for Document Intelligence: Calling Azure AI Document Intelligence using the REST API

End-to-End Pipeline Template Using Durable Functions: Azure AI Document Processing Pipeline using Python Durable Functions

Please! Do not forget to close up the thread here by upvoting and accept it as an answer, for the benefit of this community.

Thank you.

Share via

Improving Scientific Notation Accuracy with Azure Document Intelligence Layout Model

0 additional answers

Your answer