I'm Artemis, a PhD student at the Computer Vision Center (CVC), in the Vision and Language group where I’m doing research on Multimodal Models for Document Understanding.
📅 Sep 2024 – present
- Vision-Language Models for Efficient Long Context Document Understanding
📅 Sep 2022 – Aug 2024
- Research Multimodal Transformers for Multi-Page Document Understanding.
- Visual and Language Fusion for Automatic Inventory of Libraries and Supermarkets.
- Automatic Verification of Multimodal Social Media Posts.
📅 Nov 2021 – Aug 2022 · Part Time
- Developing Multi-Task Models for Document Understanding: Document Classification, Information Extraction, DocVQA.
📅 Jan 2021 – Nov 2023 · Part Time
- Research and Development of Coding Techniques for Satellite Imagery.
- Development of Deep Learning Algorithms for Remote Sensing.
- Designed and trained machine learning algorithms for animal welfare in the Clearfarm project.