musl/src/string, branch v0.9.8

simplify logic in stpcpy; avoid copying first aligned byte twice

2012-10-22T19:17:09+00:00

gcc seems to be generating identical or near-identical code for both
versions, but the newer code is more expressive of what it's doing.

add memmem function (gnu extension)

2012-10-16T03:02:57+00:00

based on strstr. passes gnulib tests and a few quick checks of my own.

optimize strchrnul/strcspn not to scan string twice on no-match

2012-09-27T21:19:09+00:00

when strchr fails, and important piece of information already
computed, the string length, is thrown away. have strchrnul (with
namespace protection) be the underlying function so this information
can be kept, and let strchr be a wrapper for it. this also allows
strcspn to be considerably faster in the case where the match set has
a single element that's not matched.

slightly cleaner strlen, also seems to compile to better code

2012-09-27T20:56:33+00:00

testing with gcc 4.6.3 on x86, -Os, the old version does a duplicate
null byte check after the first loop. this is purely the compiler
being stupid, but the old code was also stupid and unintuitive in how
it expressed the check.

asm for memmove on i386 and x86_64

2012-09-10T23:04:24+00:00

for the sake of simplicity, I've only used rep movsb rather than
breaking up the copy for using rep movsd/q. on all modern cpus, this
seems to be fine, but if there are performance problems, there might
be a need to go back and add support for rep movsd/q.

reenable word-at-at-time copying in memmove

2012-09-10T22:16:11+00:00

before restrict was added, memove called memcpy for forward copies and
used a byte-at-a-time loop for reverse copies. this was changed to
avoid invoking UB now that memcpy has an undefined copying order,
making memmove considerably slower.

performance is still rather bad, so I'll be adding asm soon.

use restrict everywhere it's required by c99 and/or posix 2008

2012-09-07T02:44:55+00:00

to deal with the fact that the public headers may be used with pre-c99
compilers, __restrict is used in place of restrict, and defined
appropriately for any supported compiler. we also avoid the form
[restrict] since older versions of gcc rejected it due to a bug in the
original c99 standard, and instead use the form *restrict.

remove dependency of wmemmove on wmemcpy direction

2012-09-07T00:28:42+00:00

unlike the memmove commit, this one should be fine to leave in place.
wmemmove is not performance-critical, and even if it were, it's
already copying whole 32-bit words at a time instead of bytes.

remove dependency of memmove on memcpy direction

2012-09-07T00:25:48+00:00

this commit introduces a performance regression in many uses of
memmove, which will need to be addressed before the next release. i'm
making it as a temporary measure so that the restrict patch can be
committed without invoking undefined behavior when memmove calls
memcpy with overlapping regions.

memcpy asm for i386 and x86_64

2012-08-12T01:33:13+00:00