A collection of some of the projects I have worked on.
A multimodal AI application that converts static images into narrated audio stories by chaining Computer Vision, LLMs, and Text-to-Speech models.