Name		Name	Last commit message	Last commit date
parent directory ..
images		images
README.md		README.md
ThothSecurityDataset.ipynb		ThothSecurityDataset.ipynb

README.md

Thoth Security Datasets

Thoth Security Datasets contain outputs from two Thoth Security Indicators (SI) Analyzers and aggregated results from those two:

SI-bandit is an analyzer for security indicators based on bandit Python package, a tool designed to find common security issues in Python code. This Python package has different classes of tests:
- B1xx misc tests
- B2xx application/framework misconfiguration
- B3xx blacklists (calls)
- B4xx blacklists (imports)
- B5xx cryptography
- B6xx injection
- B7xx XSS
Each test in a group has two assigned parameters:
- level of SEVERITY.
- level of CONFIDENCE.
that are manually assigned.
SI-cloc is an analyzer for security indicators based on cloc RPM package that counts blank lines, comment lines, and physical lines of source code in many programming languages. It's important to take into account some of the known limitations for this package:
- Lines containing both source code and comments are counted as lines of code.
- Python docstrings can serve several purposes. They may contain documentation, comment out blocks of code, or they can be regular strings (when they appear on the right hand side of an assignment or as a function argument). cloc is unable to infer the meaning of docstrings by context; by default, cloc treats all docstrings as comments. The switch --docstring-as--code treats all docstrings as code.
- Language definition files read with --read-lang-def or --force-lang-def must be plain ASCII text files.

Thoth Security Dataset v2.0

This dataset is made by ~1 SI-bandit reports and ~6385 SI-cloc reports in json format: ~368.4Mb once extracted and it is described in the notebook called Thoth Security Dataset.

This notebook explore results from the two analyzer run in Security Indicator workflow and show the type of analysis and information that Thoth is learning to advice on security.

Some of the results you can find:

If you want to know more just run the notebook!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thoth-security-dataset

thoth-security-dataset

README.md

Thoth Security Datasets

Thoth Security Dataset v2.0

Files

thoth-security-dataset

Directory actions

More options

Directory actions

More options

Latest commit

History

thoth-security-dataset

Folders and files

parent directory

README.md

Thoth Security Datasets

Thoth Security Dataset v2.0