Skip to main content
4. Future Consideration

Pictures or image files summarization by the assistant

Related products:Agent StudioAI Assistant
  • February 6, 2026
  • 2 replies
  • 31 views

Forum|alt.badge.img+3

Hi Community,

We had a request where for example a person is locked out of their computer - the assistant takes an image and based on the image extract the key of the machine to unblock the computer.

The idea would be to be able to include image files, screenshots and pictures taken from Microsoft Teams and moveworks would be able to act - summarize - extract relevant information for the user.

Furthermore this would also assist if a user is having a big error message  and he could take a print screen and the assistant would convert this to text  and query the knowledge base.

thank you

2 replies

Ajay Merchia
Forum|alt.badge.img+3
  • Community Manager
  • March 23, 2026

Thanks for the suggestion, João. Native image understanding is something we're actively thinking about, though it's not immediately on our roadmap. We'll let you know when it is.

In the meantime, you can work around this today by passing the file content from a File Slot to an external vision API (like GPT-4o Vision) via an HTTP Action in a Compound Action. It's extra setup but it works.

For the locked computer scenario — is the key something readable via OCR, or does it require more complex image understanding?


Ajay Merchia
Forum|alt.badge.img+3
  • Community Manager
  • March 23, 2026
Updated idea status 1. New4. Future Consideration