By combining OutSystems Architecture with Gen AI, we can greatly optimise manual document processing efforts. We will compare and contrast in real-time how Microsoft and Google differ for processing documents, images, and screenshots, ultimately assisting customers to gain meaningful insights quickly. We will also show how to expand this to include multilingual capabilities, as well as incorporation of the microphone and speakers.