Hi Andrei Pruteanu,
I sincerely apologize for the delayed response.
Thank you for your question and for catching that detail, you are absolutely correct. The Custom Neural model is not currently visible or available in the Document Intelligence Studio interface. This capability is still in private/limited preview, meaning only select customers or internal testing environments have access to it at this time.
At present, the Document Intelligence Studio supports two main custom model types: Custom Extraction (for structured data field extraction) and Custom Classification (for document sorting). The Custom Extraction model can recognize and extract labeled fields, including signature areas, but it supports only one signature per page when using the built-in "Signature" label type. The object detection–style flexibility you had with Custom Vision (detecting multiple stamps or signatures per page) is not yet part of the Studio UI.
If your current goal is to detect multiple visual elements like stamps or signatures on a single page, there are two paths you can take. The first option is to continue using Custom Extraction models, but instead of the “Signature” label type, define multiple labeled fields such as Signature_1, Signature_2, Stamp_1, etc. This allows you to tag and train the model to recognize several regions per page. While this isn’t full object detection, it can still help identify and extract multiple regions effectively.
Another option is to use Azure AI Foundry or Azure Machine Learning if you need a more advanced object detection approach. You can train a model externally (for example, using a YOLO-based architecture) to detect stamps or signatures, and then integrate that detection output with Document Intelligence for text extraction. Azure AI Foundry makes this integration easier and provides flexibility similar to what you had with Custom Vision.
If you only need to identify multiple stamps or signatures on a page, the Custom Extraction workaround should meet your needs. However, if you require true object detection or more complex visual analysis, combining Azure AI Foundry or Azure ML with Document Intelligence is the better choice. This approach allows you to handle visual detection and text extraction together in one automated workflow.
Microsoft is actively working on making the Custom Neural model generally available. This new capability will bring back advanced object detection features similar to Custom Vision but optimized for document scenarios, enabling you to train models that detect multiple visual elements per page directly within Document Intelligence Studio.
Thank you!