Skip to content
View llabres's full-sized avatar

Highlights

  • Pro

Organizations

@LLACorp

Block or report llabres

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
llabres/README.md

Hi there 👋

I'm Artemis, a PhD student at the Computer Vision Center (CVC), in the Vision and Language group where I’m doing research on Multimodal Models for Document Understanding.

EXPERIENCE

Computer Vision Center (CVC)

Predoctoral Researcher

📅 Sep 2024 – present

  • Vision-Language Models for Efficient Long Context Document Understanding

Research Engineer

📅 Sep 2022 – Aug 2024

  • Research Multimodal Transformers for Multi-Page Document Understanding.
  • Visual and Language Fusion for Automatic Inventory of Libraries and Supermarkets.
  • Automatic Verification of Multimodal Social Media Posts.

Internship

📅 Nov 2021 – Aug 2022 · Part Time

  • Developing Multi-Task Models for Document Understanding: Document Classification, Information Extraction, DocVQA.

Group of Interactive Coding of Images (GICI)

Research Engineer

📅 Jan 2021 – Nov 2023 · Part Time

  • Research and Development of Coding Techniques for Satellite Imagery.
  • Development of Deep Learning Algorithms for Remote Sensing.
  • Designed and trained machine learning algorithms for animal welfare in the Clearfarm project.

Education

Pinned Loading

  1. DocT5 DocT5 Public

    Multimodal Document Understanding Model

    Python

  2. library-dataset library-dataset Public

    Dataset and Code from the paper: Library Dataset: Automatic Inventory as a Many to Many Matching Task

    4