Update handling of quoteStart to prevent sanitization bypass #201

TomAnthony · 2020-07-23T22:29:55Z

There is a sanitization failure scenario impacting cases where the onTag function is used to customise how certain tags are handled. The second parameter to this function is the HTML of the opening tag, complete with attributes, and is susceptible to malformed HTML which means it is not sanitized before being passed to the function. If the onTag customisation function then uses this HTML as part of the output then an XSS is possible.

Sanitizer logic error

The error is at lines 88-91 of the parser:

if ((c === '"' || c === "'")) {
	if (html.charAt(currentPos - i) === "=") {
		quoteStart = c;
		continue;
	}
}

The tokenizer goes into the 'inside quoted value' state when it encounters a " or ' character that is immediately preceded by a =.

However if we supply this input:

<a target= " href="><script>alert(2)</script>">

The target attribute has a space after the = character, so the tokenizer fails to enter the 'inside quoted value' state. It then continues parsing until it meets href=" at which point if erroneously enters the 'inside quoted value' state and parses the subsequent <script> tags as beloning to that attribute, until the " following those tags.

This allows smuggling the <script> tags or other malicious content into the html variable that is passed as the second parameter to the custom onTag function. When the custom tag function uses that value (presuming it has been correctly sanitized) the malicious payload is injected back in the output:

<a target= " href="><script>alert(2)</script>"><span>

Browsers are robust to the target= " snippet including a space, so parse the DOM like so:

<a target=" href=">
	<script>alert(2)</script>
	"&gt;
	<span></span>
</a>

Minimum reproduction

This is a fairly minimal reproduction of the failure case:

var xss = require('xss');

inputData = `<a target= " href="><script>alert(1)</script>"><span>`;

var O = {
	onTag: function(_, E, S) {
		if (S.isWhite && "a" === _) {
			if (S.isClosing)
				return "</span></a>";
			return "".concat(E, '<span>')
		}
	}
};

var html = xss(inputData, O);
console.log(html);

Patch

My suggested fix (bear in mind I'm not a JS person!) is to add a while loop that allows for spaces between = and the quote starting an attribute value. All the tests pass still. I have a PR prepared that also adds a test case for this scenario, but didn't want to submit the PR publicly until I spoke to you.

An alternative approach would try to keep track of whether the last character, ignoring whitespace was an =. However, when I mapped this out it added a lot of complexity.

Tests

All the tests pass still, but I have also added a new test. Let me know if you think more would be sensible.

…new test case for this failure scenario.

lib/parser.js

test/test_custom_method.js

leizongmin · 2020-07-27T02:29:24Z

I just published a new version [email protected] including this changes. Thanks for your pull request.

Update handling of quoteStart to prevent sanitization bypass

Update handling of quoteStart to allow for whitespace after =. Add a …

cdd3e36

…new test case for this failure scenario.

leizongmin reviewed Jul 24, 2020

View reviewed changes

lib/parser.js Outdated Show resolved Hide resolved

test/test_custom_method.js Outdated Show resolved Hide resolved

test/test_custom_method.js Outdated Show resolved Hide resolved

Make coding style project consistent.

433dbd7

leizongmin merged commit 212883e into leizongmin:master Jul 24, 2020

leizongmin added a commit that referenced this pull request May 6, 2021

Merge pull request #201 from TomAnthony/fix-bypass-issue

353ffdc

Update handling of quoteStart to prevent sanitization bypass

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update handling of quoteStart to prevent sanitization bypass #201

Update handling of quoteStart to prevent sanitization bypass #201

TomAnthony commented Jul 23, 2020

leizongmin commented Jul 27, 2020

Update handling of quoteStart to prevent sanitization bypass #201

Update handling of quoteStart to prevent sanitization bypass #201

Conversation

TomAnthony commented Jul 23, 2020

Sanitizer logic error

Minimum reproduction

Patch

Tests

leizongmin commented Jul 27, 2020