musl - musl - an implementation of the standard library for Linux-based systems

Age	Commit message (Collapse)	Author	Lines
2012-05-25	fix regex on arm	Rich Felker	-1/+1
	TRE has a broken assumption that wchar_t is signed, which is a sane expectation, but not required by the standard, and false on ARM's ABI. i leave tre_char_t as wchar_t for now, since a pointer to it is directly passed to functions that need pointer to wchar_t. it does not seem to break anything. and since the maximum unicode scalar value is 0x10ffff, just use that explicitly rather than using the max value of any particular C type.
2012-05-13	remove some no-op end of string tests from regex parser	Rich Felker	-4/+0
	these are cruft from the original code which used an explicit string length rather than null termination. i blindly converted all the checks to null terminator checks, without noticing that in several cases, the subsequent switch statement would automatically handle the null byte correctly.
2012-05-13	another BRE fix: in ^, is literal	Rich Felker	-0/+2
	i don't understand why this has to be conditional on being in BRE mode, but enabling this code unconditionally breaks a huge number of ERE test cases.
2012-05-07	fix error checking for \ at end of regex (this was broken previously)	Rich Felker	-1/+1

2012-05-07	fix copy and paste error in regex code causing mishandling of \) in BRE	Rich Felker	-1/+1

2012-05-07	fix regex breakage in last commit (failure to handle empty regex, etc.)	Rich Felker	-4/+1

2012-05-07	fix ugly bugs in TRE regex parser	Rich Felker	-60/+31
	1. * in BRE is not special at the beginning of the regex or a subexpression. this broke ncurses' build scripts. 2. \\( in BRE is a literal \ followed by a literal (, not a literal \ followed by a subexpression opener. 3. the ^ in \\(^ in BRE is a literal ^ only at the beginning of the entire BRE. POSIX allows treating it as an anchor at the beginning of a subexpression, but TRE's code for checking if it was at the beginning of a subexpression was wrong, and fixing it for the sake of supporting a non-portable usage was too much trouble when just removing this non-portable behavior was much easier. this patch also moved lots of the ugly logic for empty atom checking out of the default/literal case and into new cases for the relevant characters. this should make parsing faster and make the code smaller. if nothing else it's a lot more readable/logical. at some point i'd like to revisit and overhaul lots of this code...
2012-04-28	new fnmatch implementation	Rich Felker	-131/+273
	unlike the old one, this one's algorithm does not suffer from potential stack overflow issues or pathologically bad performance on certain patterns. instead of backtracking, it uses a matching algorithm which I have not seen before (unsure whether I invented or re-invented it) that runs in O(1) space and O(nm) time. it may be possible to improve the time to O(n), but not without significantly greater complexity.
2012-04-26	update fnmatch to POSIX 2008 semantics	Rich Felker	-4/+11
	an invalid bracket expression must be treated as if the opening bracket were just a literal character. this is to fix a bug whereby POSIX left the behavior of the "[" shell command undefined due to it being an invalid bracket expression.
2012-04-14	fix signedness error handling invalid multibyte sequences in regexec	Rich Felker	-2/+2
	the "< 0" test was always false due to use of an unsigned type. this resulted in infinite loops on 32-bit machines (adding -1U to a pointer is the same as adding -1) and crashes on 64-bit machines (offsetting the string pointer by 4gb-1b when an illegal sequence was hit).
2012-04-13	remove invalid code from TRE	Rich Felker	-14/+0
	TRE wants to treat + and ? after a +, ?, or * as special; ? means ungreedy and + is reserved for future use. however, this is non-conformant. although redundant, these redundant characters have well-defined (no-op) meaning for POSIX ERE, and are actually _literal_ characters (which TRE is wrongly ignoring) in POSIX BRE mode. the simplest fix is to simply remove the unneeded nonstandard functionality. as a plus, this shaves off a small amount of bloat.
2012-04-13	fix broken regerror (typo) and missing message	Rich Felker	-2/+2

2012-03-20	upgrade to latest upstream TRE regex code (0.8.0)	Rich Felker	-1168/+1037
	the main practical results of this change are 1. the regex code is no longer subject to LGPL; it's now 2-clause BSD 2. most (all?) popular nonstandard regex extensions are supported I hesitate to call this a "sync" since both the old and new code are heavily modified. in one sense, the old code was "more severely" modified, in that it was actively hostile to non-strictly-conforming expressions. on the other hand, the new code has eliminated the useless translation of the entire regex string to wchar_t prior to compiling, and now only converts multibyte character literals as needed. in the future i may use this modified TRE as a basis for writing the long-planned new regex engine that will avoid multibyte-to-wide character conversion entirely by compiling multibyte bracket expressions specific to UTF-8.
2012-01-23	make glob mark symlinks-to-directories with the GLOB_MARK flag	Rich Felker	-1/+1
	POSIX is unclear on whether it should, but all historical implementations seem to behave this way, and it seems more useful to applications.
2012-01-22	support GLOB_PERIOD flag (GNU extension) to glob function	Rich Felker	-1/+2
	patch by sh4rm4
2011-06-16	duplicate re_nsub in LSB/glibc ABI compatible location	Rich Felker	-1/+1

2011-06-06	fix handling of d_name in struct dirent	Rich Felker	-3/+2
	basically there are 3 choices for how to implement this variable-size string member: 1. C99 flexible array member: breaks using dirent.h with pre-C99 compiler. 2. old way: length-1 string: generates array bounds warnings in caller. 3. new way: length-NAME_MAX string. no problems, simplifies all code. of course the usable part in the pointer returned by readdir might be shorter than NAME_MAX+1 bytes, but that is allowed by the standard and doesn't hurt anything.
2011-06-05	safety fix for glob's vla usage: disallow patterns longer than PATH_MAX	Rich Felker	-0/+2
	this actually inadvertently disallows some valid patterns with redundant / or * characters, but it's better than allowing unbounded vla allocation. eventually i'll write code to move the pattern to the stack and eliminate redundancy to ensure that it fits in PATH_MAX at the beginning of glob. this would also allow it to be modified in place for passing to fnmatch rather than copied at each level of recursion.
2011-06-05	eliminate (harmless in this case) vla usage in fnmatch.c	Rich Felker	-1/+1

2011-04-07	fix bug in TRE found by clang (typo && instead of &)	Rich Felker	-1/+1

2011-02-12	initial check-in, version 0.5.0v0.5.0	Rich Felker	-0/+5364