Skip to content

Commit

Permalink
squash!
Browse files Browse the repository at this point in the history
  • Loading branch information
NikolaRHristov committed Apr 1, 2024
1 parent cae97cc commit 9a7e8a8
Show file tree
Hide file tree
Showing 6 changed files with 23 additions and 23 deletions.
2 changes: 1 addition & 1 deletion Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ documentation = "https://docs.rs/unicode-id-start"
edition = "2018"
keywords = ["unicode", "id"]
license = "(MIT OR Apache-2.0) AND Unicode-DFS-2016"
repository = "https://github.com/Boshen/unicode-id-start"
repository = "https://github.com/Boshen/unicode-id-start"
rust-version = "1.31"

[dev-dependencies]
Expand Down
20 changes: 10 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,10 +5,10 @@
Unicode ID_start
=============

[<img alt="github" src="https://img.shields.io/badge/github-dtolnay/unicode--ident-8da0cb?style=for-the-badge&labelColor=555555&logo=github" height="20">](https://github.com/dtolnay/unicode-ident)
[<img alt="github" src="https://img.shields.io/badge/github-dtolnay/unicode--ident-8da0cb?style=for-the-badge&labelColor=555555&logo=github" height="20">](https://github.com/dtolnay/unicode-ident)
[<img alt="crates.io" src="https://img.shields.io/crates/v/unicode-ident.svg?style=for-the-badge&color=fc8d62&logo=rust" height="20">](https://crates.io/crates/unicode-ident)
[<img alt="docs.rs" src="https://img.shields.io/badge/docs.rs-unicode--ident-66c2a5?style=for-the-badge&labelColor=555555&logo=docs.rs" height="20">](https://docs.rs/unicode-ident)
[<img alt="build status" src="https://img.shields.io/github/actions/workflow/status/dtolnay/unicode-ident/ci.yml?branch=master&style=for-the-badge" height="20">](https://github.com/dtolnay/unicode-ident/actions?query=branch%3Amaster)
[<img alt="build status" src="https://img.shields.io/github/actions/workflow/status/dtolnay/unicode-ident/ci.yml?branch=master&style=for-the-badge" height="20">](https://github.com/dtolnay/unicode-ident/actions?query=branch%3Amaster)

Implementation of [Unicode Standard Annex #31][tr31] for determining which
`char` values are valid in programming language identifiers.
Expand Down Expand Up @@ -44,11 +44,11 @@ The remaining columns show the **cost per call** to evaluate whether a single
`char` has the ID\_Start or ID\_Continue Unicode property, comparing across
different ratios of ASCII to non-ASCII codepoints in the input data.

[`unicode-ident`]: https://github.com/dtolnay/unicode-ident
[`unicode-xid`]: https://github.com/unicode-rs/unicode-xid
[`unicode-id`]: https://github.com/Boshen/unicode-id
[`ucd-generate`]: https://github.com/BurntSushi/ucd-generate
[`roaring`]: https://github.com/RoaringBitmap/roaring-rs
[`unicode-ident`]: https://github.com/dtolnay/unicode-ident
[`unicode-xid`]: https://github.com/unicode-rs/unicode-xid
[`unicode-id`]: https://github.com/Boshen/unicode-id
[`ucd-generate`]: https://github.com/BurntSushi/ucd-generate
[`roaring`]: https://github.com/RoaringBitmap/roaring-rs

| | static storage | 0% nonascii | 1% | 10% | 100% nonascii |
|---|---|---|---|---|---|
Expand Down Expand Up @@ -113,7 +113,7 @@ about 7 K to store.
Their data structure is a compressed trie set specifically tailored for Unicode
codepoints. The design is credited to Raph Levien in [rust-lang/rust#33098].

[rust-lang/rust#33098]: https://github.com/rust-lang/rust/pull/33098
[rust-lang/rust#33098]: https://github.com/rust-lang/rust/pull/33098

```rust
pub struct TrieSet {
Expand Down Expand Up @@ -160,8 +160,8 @@ Uses a [finite state transducer][fst]. This representation is built into
representation. In particular `ucd-trie` is optimized for storing Unicode
properties while `fst` is not.

[fst]: https://github.com/BurntSushi/fst
[ucd-generate]: https://github.com/BurntSushi/ucd-generate
[fst]: https://github.com/BurntSushi/fst
[ucd-generate]: https://github.com/BurntSushi/ucd-generate

As far as I can tell, the main thing that causes `fst` to have large size and
slow lookups for this use case relative to `ucd-trie` is that it does not
Expand Down
18 changes: 9 additions & 9 deletions Source/lib.rs
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
//! [![github]](https://github.com/Boshen/unicode-id-start)&ensp;[![crates-io]](https://crates.io/crates/unicode-id-start)&ensp;[![docs-rs]](https://docs.rs/unicode-id-start)
//! [![github]](https://github.com/Boshen/unicode-id-start)&ensp;[![crates-io]](https://crates.io/crates/unicode-id-start)&ensp;[![docs-rs]](https://docs.rs/unicode-id-start)
//!
//! [github]: https://img.shields.io/badge/github-8da0cb?style=for-the-badge&labelColor=555555&logo=github
//! [crates-io]: https://img.shields.io/badge/crates.io-fc8d62?style=for-the-badge&labelColor=555555&logo=rust
Expand Down Expand Up @@ -37,11 +37,11 @@
//! comparing across different ratios of ASCII to non-ASCII codepoints in the
//! input data.
//!
//! [`unicode-ident`]: https://github.com/dtolnay/unicode-ident
//! [`unicode-xid`]: https://github.com/unicode-rs/unicode-xid
//! [`unicode-id`]: https://github.com/Boshen/unicode-id
//! [`ucd-generate`]: https://github.com/BurntSushi/ucd-generate
//! [`roaring`]: https://github.com/RoaringBitmap/roaring-rs
//! [`unicode-ident`]: https://github.com/dtolnay/unicode-ident
//! [`unicode-xid`]: https://github.com/unicode-rs/unicode-xid
//! [`unicode-id`]: https://github.com/Boshen/unicode-id
//! [`ucd-generate`]: https://github.com/BurntSushi/ucd-generate
//! [`roaring`]: https://github.com/RoaringBitmap/roaring-rs
//!
//! | | static storage | 0% nonascii | 1% | 10% | 100% nonascii |
//! |---|---|---|---|---|---|
Expand Down Expand Up @@ -112,7 +112,7 @@
//! Unicode codepoints. The design is credited to Raph Levien in
//! [rust-lang/rust#33098].
//!
//! [rust-lang/rust#33098]: https://github.com/rust-lang/rust/pull/33098
//! [rust-lang/rust#33098]: https://github.com/rust-lang/rust/pull/33098
//!
//! ```rust
//! pub struct TrieSet {
Expand Down Expand Up @@ -159,8 +159,8 @@
//! representation. In particular `ucd-trie` is optimized for storing Unicode
//! properties while `fst` is not.
//!
//! [fst]: https://github.com/BurntSushi/fst
//! [ucd-generate]: https://github.com/BurntSushi/ucd-generate
//! [fst]: https://github.com/BurntSushi/fst
//! [ucd-generate]: https://github.com/BurntSushi/ucd-generate
//!
//! As far as I can tell, the main thing that causes `fst` to have large size
//! and slow lookups for this use case relative to `ucd-trie` is that it does
Expand Down
2 changes: 1 addition & 1 deletion benches/id.rs
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@
// functions' time, then dividing by one million (ms -> ns).

#![allow(
clippy::incompatible_msrv, // https://github.com/rust-lang/rust-clippy/issues/12257
clippy::incompatible_msrv, // https://github.com/rust-lang/rust-clippy/issues/12257
clippy::needless_pass_by_value,
)]

Expand Down
2 changes: 1 addition & 1 deletion generate/Source/main.rs
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@

#![allow(
clippy::cast_lossless,
clippy::cast_possible_truncation, // https://github.com/rust-lang/rust-clippy/issues/9613
clippy::cast_possible_truncation, // https://github.com/rust-lang/rust-clippy/issues/9613
clippy::let_underscore_untyped,
clippy::match_wild_err_arm,
clippy::module_name_repetitions,
Expand Down
2 changes: 1 addition & 1 deletion tests/compare.rs
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
#![allow(
clippy::incompatible_msrv, // https://github.com/rust-lang/rust-clippy/issues/12257
clippy::incompatible_msrv, // https://github.com/rust-lang/rust-clippy/issues/12257
)]

mod fst;
Expand Down

0 comments on commit 9a7e8a8

Please sign in to comment.