Using the Android ML Kit Document Scanner API

Feature Requests / Funktionswünsche

Using the Android ML Kit Document Scanner API
DBoho
29.12.2025 13:42 #1

Hey,

This project seems like a great start for a user-centric Paperless-ngx mobile client. One of the core challenges is designing a high-quality UX and UI for the scanning process.

I have also started experimenting with building a client to simplify uploading documents to a Paperless-ngx instance. I found the Android ML Kit Document Scanner API [1, 2] to be an easy-to-use API for Android. The UX is excellent, and most features—such as Black & White/Color filters and PDF/JPEG export—are already implemented. It might be worth giving it a try!

You could then expose the settings to predefine SCANNER_MODE, CAPTURE_MODE, and RESULT_FORMAT within the user settings.

Thanks for this product; I’m excited to see future releases.

Best regards,

 

[1]: https://android-developers.googleblog.com/2024/02/ml-kit-document-scanner-api.html

[2]: https://developers.google.com/ml-kit/vision/doc-scanner 

 

 

Re: Using the Android ML Kit Document Scanner API
Björn Ternes Team
30.12.2025 11:37 #2

Hey DBoho,

Thank you so much for your suggestion and the positive feedback – I'm really glad you like the project!

I took a closer look at the ML Kit Document Scanner API right after reading your post, and I have to say: you're absolutely right. The API is really impressive and takes a huge amount of work off your hands. The UX is indeed excellent, and features like the color/black & white filters and the various export formats are already solidly implemented.

I've already integrated it into my current development build and I'm currently experimenting with it.

Here is a demo video: Demo Video (share.rebtsoft.com)

I am also going a step further and currently looking into the "Apple VisionKit" to also improve the scanner experience on apple devices. 

Thanks
Björn