0.8.0 changes #1

oliverchang · 2021-08-05T06:48:30Z

Rename "affects" to "affected", and make it a list of objects.
Represent version ranges using "events" rather than ranges of [introduced, fixed)
Move "package", "ecosystem_specific", "database_specific" to "affected".

- Rename "affects" to "affected", and make it a list of objects. - Move "package", "ecosystem_specific", "database_specific" to "affected". - Add "routines" and "platforms" to "affected" as well.

schema.md

oliverchang · 2021-08-09T23:26:45Z

Also moved repo as discussed: 67693a7

joshbressers · 2021-08-18T12:56:03Z

@oliverchang To followup from our conversation yesterday (or probably your today)

I mentioned I didn't like the new "affected" and I realize why now

	"affected": [ {
		"package": {
			"ecosystem": string,
			"name": string,
			"purl": string,
		},
		"ranges": [ {
			"type": string,
			"repo": string,
			"events": [ {
				"introduced": string,
				"fixed": string
			} ]
		} ],
		"versions": [ string ],
		"ecosystem_specific": { see description },
		"database_specific": { see description },
	} ],

I think the way CVE overloads the package field in an ID is wrong in the modern world. It made sense 20 years ago

Here is the example that made me realize this. It's the same problem for copy and pasting code, or embedding source files, so I ask whoever reads this not to nitpick it too much. It's a hard problem.

Let's say I have a npm project, npm will embed the dependencies, so for example it's VERY common for a project to ship multiple copies of the lodash library as different packages can depend on different versions.

If there is one vulnerability in lodash, today it would get one ID and you would see your application is affected by the same ID multiple times. This means you actually have to key off package:ID, because just the ID isn't enough useful information.

Now, package:ID as the key may not be the worst thing in the world, but it probably creates some new challenges we don't understand. Today I think just using an ID is the most common use case. I want to think about this more, but I wanted to suggest that allowing an array of packages should not be taken lightly.

kaniini · 2021-08-18T18:10:00Z

I don't follow this argument, any scanner worth anything should automatically dedup any reported vulnerabilities, I think.

joshbressers · 2021-08-18T18:24:37Z

I don't follow this argument, any scanner worth anything should automatically dedup any reported vulnerabilities, I think.

It's not about the dupes so much as it is about having a way to understand what is vulnerable to a given problem. I could be off my rocker here as well, this makes sense in my head.

Let me just make something up to illustrate my thinking. This is a little unrealistic I know.

Let's say I have a project, and there are two different libraries that run as webservers to service REST requests. The two libraries each share one common source file that is vulnerable to a RCE. In today's model, that would be one ID both libraries share, even though they are unrelated.

Is that acceptable, or would it be better to give each library their own ID?

From a purely automated perspective, it doesn't really matter. But if I have to discuss this problem with a fellow human, I now have to clarify which library we are talking about as part of the conversation, so the identifier is really package:ID, not just the ID

kaniini · 2021-08-18T18:31:37Z

I get what you're saying there. In that case, I think a scanner should present it as two vulnerabilities, but prefixing it as subcomponent:ID. But the scanner can do that itself, can't it?

oliverchang · 2021-08-18T22:37:02Z

Thanks for the feedback!

I completely agree that in most cases, there should only ever be one package specified in a single entry (and this is what we strongly recommend), and we really wanted to keep this.

But for ease of adoption, I think we need to make a tradeoff to support interoperability with other schemas (such as CVE, GHSA) which do support (and rarely) have multiple packages listed (example: https://github.com/advisories/GHSA-wh77-3x4m-4q9g). Otherwise, this is adding burden on them to generate new IDs/entries in a way that may not fit easily into their existing workflows.

We also got feedback from the Go vulnerability DB team, who wish to express that same package may have different import paths/names (vanity URLs, versioned imports) and they don't want to create separate entries for each, because it adds complexity for them.

I get what you're saying there. In that case, I think a scanner should present it as two vulnerabilities, but prefixing it as subcomponent:ID. But the scanner can do that itself, can't it?

I agree! The scanner would already know about what package it's scanning for, and then it just has to correlate that with what's listed in the vulnerability entry.

joshbressers · 2021-08-18T22:52:16Z

I think this is all fair, I appreciate your explanation

pombredanne · 2021-08-24T11:23:02Z

schema.md

+software to answer the question "is this specific version affected?" without
+having to contain code specific to every different ecosystem. The one exception
+is if the affected versions are valid SemVer 2.0 versions which can be
+accurately summarized by one or more non-overlapping SemVer ranges. In that


@oliverchang Singling out semver as being a perfect version scheme assumes that humans handling semver versions will never break semver. I think that this cannot hold at scale.
IMHO there is no reason to treat SEMVER specially and I explained some rationale in the related CVEProject/cve-schema#87 (comment)

Instead a version range may best treated as what it is: a hint for tools and humans to help form a proper versions enumeration.

Thanks for the feedback @pombredanne. I agree an explicit list of versions makes this unambiguous in all cases.

I think tooling/validators can help with making sure the SEMVER values are valid. One other reason why we need this is that in ecosystems like Go (which use/enforce Semver), it's infeasible to enumerate every possible version. Every commit can be a "pseudoversion" (valid semver) in Go.

Thank you for the clarification. This makes sense now. I had not internalized this aspect of Go modules versioning. That's actually pretty sleek and clean. It creates some extra work when trying to enumerate these when compared to other package types... but that's an issue for tools/db to deal with..!

rsc · 2021-08-26T21:09:22Z

schema.md

- `WEB`: A web page of some unspecified kind. 
+The values of "introduced", "fixed" and "limit" are version strings as defined
+by the `affected[].ranges[].type` field. Additionally,
+  - `"introduced"` allows a version of the value `"\*"` to represent a version that


For consistency with CVE and easier conversion I think we should use 0 here.
* meaning zero in one field and * meaning infinity in another is odd anyway.

rsc

Other than the 0 I'm happy.

rsc · 2021-08-26T21:50:40Z

LGTM

An implementation of the specification change proposed by ossf/osv-schema#1. The significant change here is that instead of generating multiple entries for reports with multiple packages (in the additional_packages section), we instead generate a single entry that covers all of the packages, and write the same entry for each module path. Change-Id: Ia9d8e0a82081ab7f5becd20c6adf976f4d6966db Reviewed-on: https://go-review.googlesource.com/c/vulndb/+/340210 Reviewed-by: kokoro <[email protected]> Reviewed-by: Filippo Valsorda <[email protected]> Trust: Roland Shoemaker <[email protected]> Run-TryBot: Roland Shoemaker <[email protected]> TryBot-Result: Go Bot <[email protected]> Vulndb-Deploy: Roland Shoemaker <[email protected]>

oliverchang added 4 commits August 5, 2021 16:41

Support multiple packages in one entry.

1d62adf

- Rename "affects" to "affected", and make it a list of objects. - Move "package", "ecosystem_specific", "database_specific" to "affected". - Add "routines" and "platforms" to "affected" as well.

various small typos

fc88d28

consistent tense

d91ae70

add clarification

9faf546

oliverchang force-pushed the packages-move branch from fd52520 to 9faf546 Compare August 5, 2021 06:58

nit

af61777

oliverchang force-pushed the packages-move branch from 2701ac0 to af61777 Compare August 5, 2021 07:00

oliverchang requested a review from inferno-chromium August 5, 2021 07:00

changes

03cde24

oliverchang requested a review from rsc August 6, 2021 02:07

rsc reviewed Aug 6, 2021

View reviewed changes

schema.md Outdated Show resolved Hide resolved

schema.md Outdated Show resolved Hide resolved

schema.md Show resolved Hide resolved

schema.md Show resolved Hide resolved

schema.md Show resolved Hide resolved

address comments

8b8793b

oliverchang requested a review from rsc August 6, 2021 04:11

inferno-chromium approved these changes Aug 6, 2021

View reviewed changes

oliverchang force-pushed the packages-move branch from 48d2b87 to 7a9972b Compare August 9, 2021 23:25

move repo

67693a7

oliverchang force-pushed the packages-move branch from 7a9972b to 67693a7 Compare August 9, 2021 23:26

events

a302f9e

oliverchang force-pushed the packages-move branch from b96053e to a302f9e Compare August 17, 2021 04:20

update date

45dca31

oliverchang changed the title ~~Support multiple packages in one entry.~~ 0.8.0 changes Aug 17, 2021

oliverchang added 5 commits August 17, 2021 14:44

clarify changelog

3c35d8e

tentative earlier_affected field.

9bff9d5

clarifications

bb2ea1b

more clarifications

6d2c44d

typo

73c9ffe

oliverchang force-pushed the packages-move branch from da2d382 to 73c9ffe Compare August 18, 2021 04:33

oliverchang added 4 commits August 19, 2021 12:13

tentative: "reverse" flag

6a3160b

add git graph

c25efee

clarification

be67419

minor nit

9537676

oliverchang force-pushed the packages-move branch 2 times, most recently from 88f212b to f5ebcd3 Compare August 24, 2021 08:15

Remove "reverse". Use "limit" instead.

5b39f49

oliverchang force-pushed the packages-move branch from f5ebcd3 to 5b39f49 Compare August 24, 2021 08:25

pombredanne reviewed Aug 24, 2021

View reviewed changes

small clarifications

0a6e1c9

rsc reviewed Aug 26, 2021

View reviewed changes

Use "0" for beginning of time instead.

1261cbf

rsc approved these changes Aug 26, 2021

View reviewed changes

oliverchang merged commit f1e13a7 into main Aug 26, 2021

Shnatsel mentioned this pull request Aug 27, 2021

Support OSV v0.8 rustsec/rustsec#421

Merged

oliverchang deleted the packages-move branch September 13, 2021 23:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.8.0 changes #1

0.8.0 changes #1

oliverchang commented Aug 5, 2021 •

edited

Loading

oliverchang commented Aug 9, 2021

joshbressers commented Aug 18, 2021

kaniini commented Aug 18, 2021

joshbressers commented Aug 18, 2021

kaniini commented Aug 18, 2021

oliverchang commented Aug 18, 2021 •

edited

Loading

joshbressers commented Aug 18, 2021

pombredanne Aug 24, 2021

oliverchang Aug 25, 2021 •

edited

Loading

pombredanne Aug 25, 2021

rsc Aug 26, 2021

rsc left a comment

rsc commented Aug 26, 2021

0.8.0 changes #1

0.8.0 changes #1

Conversation

oliverchang commented Aug 5, 2021 • edited Loading

oliverchang commented Aug 9, 2021

joshbressers commented Aug 18, 2021

kaniini commented Aug 18, 2021

joshbressers commented Aug 18, 2021

kaniini commented Aug 18, 2021

oliverchang commented Aug 18, 2021 • edited Loading

joshbressers commented Aug 18, 2021

pombredanne Aug 24, 2021

Choose a reason for hiding this comment

oliverchang Aug 25, 2021 • edited Loading

Choose a reason for hiding this comment

pombredanne Aug 25, 2021

Choose a reason for hiding this comment

rsc Aug 26, 2021

Choose a reason for hiding this comment

rsc left a comment

Choose a reason for hiding this comment

rsc commented Aug 26, 2021

oliverchang commented Aug 5, 2021 •

edited

Loading

oliverchang commented Aug 18, 2021 •

edited

Loading

oliverchang Aug 25, 2021 •

edited

Loading