-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
9 changed files
with
88 additions
and
179 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,19 +1,13 @@ | ||
[data:image/s3,"s3://crabby-images/cf07c/cf07c84c369f1737be3023a0217c194643589d75" alt="Build Status"](https://travis-ci.org/meyersbs/SPLAT) [data:image/s3,"s3://crabby-images/cd905/cd905e0a2ca7bdcc1e24610cd29a025951ccf9ef" alt="License"](/LICENSE.md) [data:image/s3,"s3://crabby-images/cf446/cf44603cde9463c4cdae7cd693520502600c0aa3" alt="codecov"](https://codecov.io/gh/meyersbs/SPLAT) | ||
[data:image/s3,"s3://crabby-images/26623/266237b6f07833bf8dc6ba6f778e6033103b5627" alt="PyPI"](https://pypi.python.org/pypi/SPLAT-library/0.3.7) [data:image/s3,"s3://crabby-images/c1005/c1005356b537059ea5b34daf24a6845701a54486" alt="PyPI"](https://pypi.python.org/pypi/SPLAT-library/0.3.7) [data:image/s3,"s3://crabby-images/e85a5/e85a59d28300a965f034d06a69a26cbaee04d318" alt="Website"](http://splat-library.org/) | ||
|
||
<img src="https://cdn.rawgit.com/meyersbs/SPLAT/master/logo.svg" width="20%"> | ||
<img src="https://cdn.rawgit.com/meyersbs/SPLAT/master/docs/logo.svg" width="20%"> | ||
<br> | ||
<img src="https://cdn.rawgit.com/meyersbs/SPLAT/master/tag.svg" width="60%"> | ||
|
||
# <em>WARNING!</em> | ||
Errors in the calculation of some metrics have been found. Calculations of the following metrics may be innaccurate: | ||
* Idea Density | ||
* Content Density | ||
* Syllable Counts | ||
<img src="https://cdn.rawgit.com/meyersbs/SPLAT/master/docs/tag.svg" width="60%"> | ||
|
||
- - - - | ||
## Contact Information | ||
Benjamin S. Meyers < <[email protected]> > | ||
Benjamin S. Meyers <[[email protected]](mailto:[email protected])> | ||
|
||
- - - - | ||
## Project Description | ||
|
@@ -22,7 +16,7 @@ SPLAT is a command-line application designed to make it easy for linguists (both | |
SPLAT is designed to help you gather linguistic features from text files and it is assumed that most input files will not be already annotated. In order for SPLAT to function properly, you should ensure that the input files that you provide do not contain any annotations. Because there are so many variations of linguistic annotation schemes, it would simply be impossible to account for all of them in the initial parsing of input files; it is easier for you to remove any existing annotations than it is for me to do so. | ||
|
||
- - - - | ||
## System Requirementsgit | ||
## System Requirements | ||
SPLAT is being developed and tested on 64-bit Ubuntu 15.10 with Python 3.4.3. Minimum requirements include: | ||
* Python 3.4 or Later | ||
* NLTK 3.1 or Later | ||
|
@@ -34,11 +28,6 @@ SPLAT is being developed and tested on 64-bit Ubuntu 15.10 with Python 3.4.3. Mi | |
2. Run the following in a command line: | ||
``` bash | ||
pip3 install SPLAT-library | ||
|
||
# Recommended, but not required. | ||
echo 'alias splat="splat-cli"' >> ~/.bashrc | ||
echo 'alias splat="splat-cli"' >> ~/.bash_profile | ||
source .bashrc | ||
``` | ||
|
||
To uninstall run the following in a command line. | ||
|
@@ -53,95 +42,18 @@ To uninstall run the following in a command line. | |
splat --help # Provide helpful information | ||
splat --info # Display version and copyright information | ||
splat --usage # Display basic command line structure | ||
splat bubble filename # Display the raw text from the file | ||
splat splat filename # Display the raw text from the file | ||
``` | ||
|
||
- - - - | ||
## Analysis Functionality \& Usage | ||
#### Types \& Tokens | ||
```bash | ||
splat tokens filename # List all Tokens | ||
splat types filename # List all Types | ||
splat ttr filename # Calculate Type-Token Ratio | ||
splat wc filename # Word Count (Token Count) | ||
splat uwc filename # Unique Word Count (Type Count) | ||
``` | ||
##### Parts-Of-Speech | ||
```bash | ||
splat pos filename # List Tokens with their Parts-Of-Speech | ||
splat poscounts filename # List Part-Of-Speech Tags with their Frequencies | ||
``` | ||
#### Syntactic Complexity | ||
```bash | ||
splat cdensity filename # Calculate Content-Density | ||
splat idensity filename # Calculate Idea Density | ||
splat flesch filename # Calculate Flesch Readability Ease | ||
splat kincaid filename # Calculate Flesch-Kincaid Grade Level | ||
splat yngve filename # Calculate Yngve-Score | ||
splat frazier filename # Calculate Frazier-Score | ||
``` | ||
#### Listing Content \& Function Words | ||
```bash | ||
splat function filename # List all Function Words | ||
splat content filename # List all Content Words | ||
splat ufunction filename # Unique Function Words | ||
splat ucontent filename # Unique Content Words | ||
splat cfr filename # Calculate Content-Function Ratio | ||
``` | ||
#### Utterances \& Sentences | ||
```bash | ||
splat utts filename # List all Utterances | ||
splat sents filename # List all Sentences | ||
splat alu filename # Average Utterance Length | ||
splat als filename # Average Sentence Length | ||
splat uttcount filename # Utterance Count | ||
splat sentcount filename # Sentence Count | ||
splat syllables filename # Display Number of Syllables | ||
splat wpu filename # List the Number of Words in each Utterance | ||
splat wps filename # List the number of Words in each Sentence | ||
``` | ||
#### Frequency Distributions | ||
```bash | ||
splat mostfreq filename x # List the x Most Frequent Words | ||
splat leastfreq filename x # List the x Least Frequent Words | ||
splat plotfreq filename x # Draw and Display a Frequency Graph | ||
``` | ||
#### Disfluencies | ||
```bash | ||
splat disfluencies filename # Calculate various Disfluency Counts | ||
splat dpa filename # List the Number of Disfluencies per each Dialog Act | ||
splat dpu filename # List the Number of Disfluencies in each Utterance | ||
splat dps filename # List the Number of Disfluencies in each Sentence | ||
``` | ||
#### Syntactic Parsing | ||
```bash | ||
splat trees filename # List Parse-Tree Strings for each Utterance | ||
splat maxdepth filename # Calculate Max Tree Depth | ||
splat drawtrees filename # Draw Parse Trees | ||
``` | ||
#### Language Modeling | ||
```bash | ||
splat unigrams filename # List all Unigrams | ||
splat bigrams filename # List all Bigrams | ||
splat trigrams filename # List all Trigrams | ||
splat ngrams filename n # List all n-grams | ||
``` | ||
## Functionality \& Usage | ||
|
||
- - - - | ||
## Annotation Functionality \& Usage | ||
```bash | ||
splat annotate filename # Semi-Automatically annotate the Utterances | ||
``` | ||
Coming Soon! | ||
|
||
- - - - | ||
## Acknowledgments | ||
I would like to thank Emily Prud'hommeaux and Cissi Ovesdotter-Alm for their guidance during my initial development process. I would also like to thank Bryan Meyers, my brother, letting me bounce ideas off of him, and for giving me wake-up calls when I was doing something in the less-than-intelligent (stupid) way. | ||
|
||
| Name | Email | Website | GitHub | | ||
|-----|-----|-----|-----| | ||
| Emily Prud'hommeaux | < <[email protected]> > | < [CLaSP](http://www.rit.edu/clasp/people.html) > | | | ||
| Cissi O. Alm | < <[email protected]> > | < [CLaSP](http://www.rit.edu/clasp/people.html) > | | | ||
| Bryan T. Meyers | < <[email protected]> > | < [DataDrake](http://www.datadrake.com/) > | < [GitHub](https://github.com/DataDrake) > | | ||
See [Acknowledgments](http://splat-library.org/#section5). | ||
|
||
- - - - | ||
## Licensing | ||
|
File renamed without changes
File renamed without changes
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,2 @@ | ||
nltk | ||
matplotlib | ||
jsonpickle |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.