API to retrieve images from within an image or pdf - Artificial Intelligence

blueberrr
November 11, 2022
158 views
0 votes
2 Answers

I am looking for a way to extract images from within another image. For example:

Here is a picture taken of a paper. It includes text, an image of a camera, and an image of a qr code. Is there an API that can possibly extract those two(camera and qr code) from this larger image and separate them into their own individual images. I know this is doable with the text(OCR), but I need to find some way to do Image Recognition if that even exists. For now, I cant find any reference to doing this besides extracting images from pdf’s, which none of those softwares have the capability to extract them from a non-perfect pdf.

Price for the API(node.js prefered, but i can adapt to use any language) is not a big concern, I’m just not sure this is even possible to due without programming a legitable artificial intelligence using machine learning, which I would no doubt cause a global internet shutdown from breaking everything if I attempted to do so.

Anyway, any suggestions would be great and much appreciated. Thanks!

EDIT: the images aren’t always those, it can be an image of anything, from potatoes to flags

Answers

- Arda
- November 11, 2022 at 9:34 pm
- 0 votes
0
For the QR code, you can simply use a QR code scanner library and convert the output back into a QR code. As for the camera, you are going to need an image recognition service like Google Cloud Vision or train your own neural network with something like TensorFlow to recognize pictures of cameras.

Login or Signup to reply.

- KJ
- November 12, 2022 at 9:33 pm
- 0 votes
0
QR detectors abound around the web and some are on github but for single objects you could try hotpot API https://hotpot.ai/docs/api
your code example linked into https://hotpot.ai/remove-background

for striping back you may need a secondary autocrop task

Login or Signup to reply.

Please signup or login to give your own answer.

Click here to cancel reply.

API to retrieve images from within an image or pdf – Artificial Intelligence

Answers