Add commented-out CopySubset shortcut
Add code to speed up CopySubset in the trivial case where a plain memcpy will suffice.
The code is commented out because it hasn't been shown that it gives a measurable speedup and it hasn't been sufficiently tested. To be analyzed (probably) during the next round of performance work.