copyi.asm

来自「a very popular packet of cryptography to」· 汇编 代码 · 共 86 行

ASM
86
字号
dnl  Pentium-4 mpn_copyi -- copy limb vector, incrementing.dnldnl  Copyright 1999, 2000, 2001 Free Software Foundation, Inc.dnl dnl  This file is part of the GNU MP Library.dnl dnl  The GNU MP Library is free software; you can redistribute it and/ordnl  modify it under the terms of the GNU Lesser General Public License asdnl  published by the Free Software Foundation; either version 2.1 of thednl  License, or (at your option) any later version.dnl dnl  The GNU MP Library is distributed in the hope that it will be useful,dnl  but WITHOUT ANY WARRANTY; without even the implied warranty ofdnl  MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNUdnl  Lesser General Public License for more details.dnl dnl  You should have received a copy of the GNU Lesser General Publicdnl  License along with the GNU MP Library; see the file COPYING.LIB.  Ifdnl  not, write to the Free Software Foundation, Inc., 59 Temple Place -dnl  Suite 330, Boston, MA 02111-1307, USA.dnl  The rep/movsl is very slow for small blocks on pentium4.  Its startupdnl  time seems to be about 110 cycles.  It then copies at a rate of onednl  limb per cycle.  We therefore fall back to an open-coded 2 c/l copyingdnl  loop for smaller sizes.dnl  Ultimately, we may want to use 64-bit movd or 128-bit movdqu in somednl  nifty unrolled arrangement.  Clearly, that could reach much higherdnl  speeds, at least for large blocks.include(`../config.m4')defframe(PARAM_SIZE, 12)defframe(PARAM_SRC, 8)defframe(PARAM_DST,  4)	TEXT	ALIGN(8)PROLOGUE(mpn_copyi)deflit(`FRAME',0)	movl	PARAM_SIZE, %ecx	cmpl	$150, %ecx	jg	L(replmovs)	movl	PARAM_SRC, %eax	movl	PARAM_DST, %edx	movl	%ebx, PARAM_SIZE	testl	%ecx, %ecx	jz	L(end)L(loop):	movl	(%eax), %ebx	leal	4(%eax), %eax	addl	$-1, %ecx	movl	%ebx, (%edx)	leal	4(%edx), %edx	jnz	L(loop)L(end):	movl	PARAM_SIZE, %ebx	retL(replmovs):	cld	C better safe than sorry, see mpn/x86/README	movl	%esi, %eax	movl	PARAM_SRC, %esi	movl	%edi, %edx	movl	PARAM_DST, %edi	rep	movsl	movl	%eax, %esi	movl	%edx, %edi	retEPILOGUE()

⌨️ 快捷键说明

复制代码Ctrl + C
搜索代码Ctrl + F
全屏模式F11
增大字号Ctrl + =
减小字号Ctrl + -
显示快捷键?