Beautiful visualizations of how language differs among document types.
-
Updated
Sep 23, 2024 - Python
Beautiful visualizations of how language differs among document types.
Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Annotated Stylistic Language Dataset with Multiple Personas"
Stylometric Data Mining Library with a focus on identifying Satoshi Nakamoto as a case study.
Python package to deal with PAN corpora and extract stylometric features from text documents.
A command-line tool for masking authorship of text, by changing the writing style with a Large Language Model.
A tool that predicts the dialect of English of an SMS message using recurrent neural networks supplemented with data from Google Trends.
The Python Graphical Authorship Attribution Program — An experimental Python port of the Duquesne University Evaluating Variations in Language Lab's JGAAP.
Usage of stylometry and machine learning in computer forensics - real tools used in 2019 by the polish police. Everything in/for polish language.
Stylometric analysis of poetic texts based on their versification
Comparison of classification power (literary authorship attribution case) of word-based, lemma-based, POS-based and mBERT-based document embeddings, as well as their combinations.
I like the name bu, but I called this User Stylometry Association, or UStylA, in my paper. In short, this just clusters users based on their stylometry - how they write stuff. This ended up as my Senior Honours project at The University of St Andrews. I had more ambitious plans but I didn't have enough time for them. This isn't half bad either t…
Writeprints-Static Feature Set exctraction for Adversarial Stylometry
Project exploring the feasibility of an automated and extensible anti-stylometry tool written in Python.
R+Python code for stylometric analysis on a corpus of Anglophone novels.
On Anonymous Commenting: A Greedy Approach to Balance Utilization and Anonymity for Instagram Users - Accepted at SIGIR 2019
This git repository documents the code base used in a custom argument retrieval system. This git repository documents the code base used in a custom argument retrieval system. The system was build as a part of the Information Retrieval module at the University of Leipzig.
Covers wide range of industry implemented topics. (Course on JOC by IIT Ropar via NPTEL)
A toolkit for analyzing register, genre and style
Add a description, image, and links to the stylometry topic page so that developers can more easily learn about it.
To associate your repository with the stylometry topic, visit your repo's landing page and select "manage topics."