Enabling a MIPS instruction combination for memset which was previously not used, and making it and and strlen instruction combination work with 64-bit mode too. Also describing why not both lb and lbu need to be handled in the strlen_lb_addiu_bne_nop fallback case. (Not much performance improvement in practice, but still.)