For all i386 string assembly functions that don't overlap use END() sothat symbol size information is available.
Add support for size optimised versions of the assembler functions inthe small subdirectory. Provide more compact versions of the functionsused by bootxx in x86.