Releases: gagolews/stringi
Releases · gagolews/stringi
stringi_1.1.3
A CRAN release.
stringi_1.1.2
CRAN bugfix release
stringi_1.0-1
v1.0-1 v1.0-1 CRAN release
stringi_0.5-3
- [BACKWARD INCOMPATIBILITY]
stri_install_check
andstri_install_icudt
are now deprecated. From now on they are supposed to be used only
by thestringi
installer. - [BUGFIX] #176: a patch for
sys/feature_tests.h
no longer included
(the original file was copyrighted by Sun Microsystems); fixed the Compiler
or options invalid for pre-UNIX 03 X/Open applications and pre-2001 POSIX
applications error by forcing_XPG6
conformance. - [BUGFIX] #174:
stri_paste()
did not generate any warning when
the recycling rule is violated andsep==""
. - [BUGFIX] #170:
setDataDirectory
no longer called if our ICU src bundle
is not used (this used to cause build problems on openSUSE). - [BUILD TIME] #169:
./configure
now tries to switch to the "standard"
C++ compiler if a C++11 one is not properly configured. - [BUILD TIME]
configure.win
(Biarch: TRUE
) now mimicsautoconf
's
AC_SUBST
andAC_CONFIG_FILES
so that the build process is now
more similar across different platforms. - [NEW FEATURE]
stri_info()
now also gives information on which ICU4C is
used (system or bundle).
stringi_0.5-2
- [NEW FUNCTIONS] #137: date-time formatting/parsing:
stri_timezone_list()
- lists all known time zone identifiersstri_timezone_set()
,stri_timezone_get()
- manage current default time zonestri_timezone_info()
- basic information on a given time zonestri_datetime_symbols()
- localizable date-time formatting datastri_datetime_fstr()
- convert astrptime
-like format string
to an ICU date/time format stringstri_datetime_format()
- convert date/time to stringstri_datetime_parse()
- convert string to date/time objectstri_datetime_create()
- construct date-time objects
from numeric representationsstri_datetime_now()
- return current date-timestri_datetime_fields()
- get values for date-time fieldsstri_datetime_add()
- add specific number of date-time units
to a date-time object
- [BUGFIX] #168: Build now fails if
icudt
is not available. - [BACKWARD INCOMPABILITY] The second argument to
stri_pad_*()
has
been renamedwidth
. - [GENERAL] #69:
stringi
is now bundled with ICU4C 55.1. - [NEW FUNCTIONS]
stri_extract_*_boundaries()
extract text between text
boundaries. - [NEW FUNCTION] #46:
stri_trans_char()
is astringi
-flavoured
chartr()
equivalent. - [NEW FUNCTION] #8:
stri_width()
approximates the width of a string
in a more Unicodish fashion thannchar(..., "width")
- [NEW FEATURE] #149:
stri_pad()
andstri_wrap()
now by default bases on
code point widths instead of the number of code points. Moreover, the default
behavior ofstri_wrap()
is now such that it does not get rid
of non-breaking, zero width, etc. spaces - [NEW FEATURE] #133:
stri_wrap()
silently allows forwidth <= 0
(for compatibility withstrwrap()
). - [NEW FEATURE] #139:
stri_wrap()
gained a new argument:whitespace_only
. - [GENERAL] #144: Performance improvements in handling ASCII strings
(these affectstri_sub()
,stri_locate()
and other string index-based
operations) - [GENERAL] #143: Searching for short fixed patterns (
stri_*_fixed()
) now
relies on the currentlibC
's implementation ofstrchr()
andstrstr()
.
This is very fast e.g. onglibc
utilizing theSSE2/3/4
instruction set. - [GENERAL] #141: a local copy of
icudt*.zip
may be used on package
install; see theINSTALL
file for more information. - [GENERAL] #165: the
./configure
option--disable-icu-bundle
forces the use of system ICU when building the package. - [BUGFIX] locale specifiers are now normalized in a more intelligent way:
e.g.@calendar=gregorian
expands toDEFAULT_LOCALE@calendar=gregorian
. - [BUGFIX] #134:
stri_extract_all_words()
did not acceptsimplify=NA
. - [BUGFIX] #132: incorrect behavior in
stri_locate_regex()
for matches
of zero lengths - [BUGFIX] stringr/#73:
stri_wrap()
returnedCHARSXP
instead ofSTRSXP
on empty string input withsimplify=FALSE
argument. - [BUGFIX] #164: libicu-dev usage used to fail on Ubuntu.
- [BUGFIX] #135: C++11 is now used by default (see the
INSTALL
file,
however) to build stringi from sources. This is because ICU4C uses the
long long
type which is not part of the C++98 standard. - [BUGFIX] #154: Dates and other objects with a custom class attribute
were not coerced to the character type correctly. - [BUGFIX] Force ICU
u_init()
call on stringi dynlib load. - [BUGFIX] #157: many overfull hboxes in the package PDF manual has been
corrected.
stringi_0.4-1
CHANGELOG:
- [IMPORTANT CHANGE]
n_max
argument instri_split_*()
has been renamedn
. - [IMPORTANT CHANGE]
simplify=FALSE
instri_extract_all_*()
and
stri_split_*()
now callsstri_list2matrix()
withfill=""
.
fill=NA_character_
may be obtained by usingsimplify=NA
. - [IMPORTANT CHANGE, NEW FUNCTIONS] #120:
stri_extract_words
has been
renamedstri_extract_all_words
andstri_locate_boundaries
-
stri_locate_all_boundaries
as well asstri_locate_words
-
stri_locate_all_words
. New functions are now available:
stri_locate_first_boundaries
,stri_locate_last_boundaries
,
stri_locate_first_words
,stri_locate_last_words
,
stri_extract_first_words
,stri_extract_last_words
. - [IMPORTANT CHANGE] #111:
opts_regex
,opts_collator
,opts_fixed
, and
opts_brkiter
can now be supplied individually via...
.
In other words, you may now simply call e.g.
stri_detect_regex(str, pattern, case_insensitive=TRUE)
instead of
stri_detect_regex(str, pattern, opts_regex=stri_opts_regex(case_insensitive=TRUE))
. - [NEW FEATURE] #110: Fixed pattern search engine's settings can
now be supplied viaopts_fixed
argument instri_*_fixed()
,
seestri_opts_fixed()
. A simple (not suitable for natural language
processing) yet very fastcase_insensitive
pattern matching can be
performed now.stri_extract_*_fixed
is again available. - [NEW FEATURE] #23:
stri_extract_all_fixed
,stri_count
, and
stri_locate_all_fixed
may now also look for overlapping pattern
matches, see?stri_opts_fixed
. - [NEW FEATURE] #129:
stri_match_*_regex
gained acg_missing
argument. - [NEW FEATURE] #117:
stri_extract_all_*()
,stri_locate_all_*()
,
stri_match_all_*()
gained a new argument:omit_no_match
.
Setting it toTRUE
makes these functions compatible with their
stringr
equivalents. - [NEW FEATURE] #118:
stri_wrap()
gainedindent
,exdent
,initial
,
andprefix
arguments. Moreover Knuth's dynamic word wrapping algorithm
now assumes that the cost of printing the last line is zero, see #128. - [NEW FEATURE] #122:
stri_subset()
gained anomit_na
argument. - [NEW FEATURE]
stri_list2matrix()
gained ann_min
argument. - [NEW FEATURE] #126:
stri_split()
now is also able to act
just likestringr::str_split_fixed()
. - [NEW FEATURE] #119:
stri_split_boundaries()
now have
n
,tokens_only
, andsimplify
arguments. Additionally,
stri_extract_all_words()
is now equipped withsimplify
arg. - [NEW FEATURE] #116:
stri_paste()
gained a new argument:
ignore_null
. Setting it toTRUE
makes this function more compatible
withpaste()
. - [NEW FEATURE] #114:
stri_paste()
:ignore_null
arg has been added. - [OTHER] #123:
useDynLib
is used to speed up symbol look-up in
the compiled dynamic library. - [BUGFIX] #94: Run-time errors on Solaris caused by setting
-DU_DISABLE_RENAMING=1
-- memory allocation errors in i.a. ICU's
UnicodeString. This setting also caused some ABSan sanity check
failures within ICU code.
stringi_0.3-1
Another CRAN release.
stringi_0.2-5
Second CRAN release.
stringi_0.1-25
First CRAN release.