GitHub - rbroc/contrastive-user-encoders: Code for: Rocca, R., & Yarkoni, T. (2022), Language models as user encoders: Self-supervised learning of user encodings using transformers, to appear in Findings of the Association for Computational Linguistics: EMNLP 2022

Contrastive training of author encoders using transformers

Includes code for constrastive user encoder from the EMNLP Findings paper:

Rocca, R., & Yarkoni, T. (2022), Language models as user encoders: Self-supervised learning of user encodings using transformers, to appear in Findings of the Association for Computational Linguistics: EMNLP 2022 (link coming soon)

Structure

This repository does not include data, but the dataset can be recreated entirely using scripts made available under reddit/preprocessing;
Model classes, trainer, and other utils can be found under reddit;
notebooks include the code needed to replicate plots presented in the paper, as well as baseline fitting;
scripts contain Python training scripts for both triplet loss training and downstream tasks;

Note: triplet loss training could be streamlined using HuggingFace's transformers library - future refactoring may simplify the current code in this direction.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
notebooks		notebooks
reddit.egg-info		reddit.egg-info
reddit		reddit
scripts		scripts
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contrastive training of author encoders using transformers

Structure

About

Releases

Packages

Languages

rbroc/contrastive-user-encoders

Folders and files

Latest commit

History

Repository files navigation

Contrastive training of author encoders using transformers

Structure

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages