⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 howto_sse2.lyx

📁 软件无线电的平台
💻 LYX
📖 第 1 页 / 共 5 页
字号:
\layout StandardCopies words from the high quadword of the source operand (second operand) and inserts them in the high quadword of the destination operand (first operand) at word locations selected with the order operand (third operand). For the PSHUFHW instruction, each 2-bit field in the order operand selects the contents of one word location in the high quadword of the destination operand. The binary encodings of the order operand fields select words (0, 1, 2, or 3 4) from the high quadword of the source operand to be copied to the destination operand. The low quadword of the source operand is copied to the low quadword of the destination operand.\layout ItemizePSHUFLW xmm1, xmm2/m128, imm8 \layout StandardSource: an XMM register or a 128-bit memory location. \layout StandardDestination: an XMM register. \layout StandardOrder: an 8-bit immediate.\layout StandardLatency: 2 Throughput: 2\layout StandardPurpose:\layout StandardCopies words from the low quadword of the source operand (second operand) and inserts them in the low quadword of the destination operand (first operand) at word locations selected with the order operand (third operand). For the PSHUFLW instruction, each 2-bit field in the order operand selects the contents of one word location in the low quadword of the destination operand. The binary encodings of the order operand fields select words (0, 1, 2, or 3) from the low quadword of the source operand to be copied to the destination operand. The high quadword of the source operand is copied to the high quadword of the destination operand.\layout StandardNote that this instruction permits a word in the low quadword of the source operand to be copied to more than one word location in the low quadword of the destination operand.\layout Standard\begin_inset Graphics	filename images_sse2/NewPSHUFLW.gif	lyxscale 55	scale 55	keepAspectRatio\end_inset \layout ItemizePSHUFD xmm1, xmm2/m128, imm8 \layout StandardSource: an XMM register or a 128-bit memory location. \layout StandardDestination: an XMM register. \layout StandardOrder: an 8-bit immediate.\layout StandardLatency: 4 Throughput: 2\layout StandardPurpose: Copies doublewords from source operand (second operand) and inserts them in the destination operand (first operand) at locations selected with the order operand (third operand). Each 2-bit field in the order operand selects the contents of one doubleword location in the destination operand. For example, bits 0 and 1 of the order operand selects the contents of doubleword 0 of the destination operand. The encoding of bits 0 and 1 of the order operand determines which doubleword from the source operand will be copied to doubleword 0 of the destination operand.\layout Standard\begin_inset Graphics	filename images_sse2/NewPSHUFD.gif	lyxscale 55	scale 55	keepAspectRatio\end_inset \layout SubsectionLogical Instructions\layout StandardMMX-\layout StandardUsage: instruction destination, source \layout StandardDestination: MMX register. \layout StandardSource: MMX register or 64-bit memory operand.\layout ItemizePAND Bitwise AND\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPAND performs a bitwise logical AND on 64 bits of the source and destination operands, and returns the result to destination. Each bit of the result of the PAND instruction is set to 1 if the corresponding bits of the operands are 1. Otherwise, it is set to 0.\layout Standard\begin_inset Graphics	filename images_mmx/NewPAND.gif	lyxscale 65	scale 55	keepAspectRatio\end_inset \layout ItemizePANDN Bitwise AND NOT\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPANDN performs a bitwise logical NOT on the 64 bits of the destination operand. The NOT inverts each of the 64 bits of the destination register so that every 1 becomes a 0, and visa versa. The instruction then performs a bitwise logical AND on the inverted 64 bits of the destination operand and on the source operand. Each bit of the result of the AND instruction is set to 1 if the corresponding bits are 1. Otherwise, it is set to 0. The result is written to the destination register. \layout ItemizePOR Bitwise OR\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPOR performs a bitwise logical OR on 64 bits of the destination and source operands, and writes the result to the destination register. Each bit of the result is set to 0 if the corresponding bits of the two operands are 0. Otherwise, the bit is 1.\layout ItemizePXOR Bitwise XOR\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardThe PXOR instruction performs a bitwise logical XOR on the 64 bits of the destination with the source operands and writes the result to destination register. Each bit of the result is 1 if the corresponding bits of the two operands are different. Each bit is 0 if the corresponding bits of the operands are the same.\layout StandardSSE-\layout ItemizeANDNPS xmm1,xmm2/mem128 \layout StandardLatency : 4 Throughput : 2\layout StandardPurpose:\layout StandardANDNPS inverts the bits of the four single-precision floating-point values in the destination register, and then performs a logical AND between the four single-precision floating-point values in the source operand and the temporary inverted result, storing the result in the destination register.\layout StandardThe destination is an XMM register. The source operand can be either an XMM register or a 128-bit memory location. B.5.4 ANDPS: Bitwise Logical AND For Single FP\layout ItemizeANDPS xmm1,xmm2/mem128 \layout StandardLatency : 4 Throughput : 2\layout StandardPurpose:\layout StandardANDPS performs a bitwise logical AND of the four single-precision floating point values in the source and destination operand, and stores the result in the destination register.\layout StandardThe destination is an XMM register. The source operand can be either an XMM register or a 128-bit memory location. \layout ItemizeORPS xmm1,xmm2/m128 \layout StandardLatency : 4 Throughput : 2\layout StandardPurpose:\layout StandardORPS return a bit-wise logical OR between xmm1 and xmm2/mem, and stores the result in xmm1. If the source operand is a memory location, it must be aligned to a 16-byte boundary\layout ItemizeXORPS xmm1,xmm2/m128\layout StandardLatency : 4 Throughput : 2\layout StandardPurpose:\layout StandardXORPS returns a bit-wise logical XOR between the source and destination operands, storing the result in the destination operand. \layout StandardSSE2-\layout ItemizeANDPD xmm, xmm \layout StandardLatency: 4 Throughput: 2\layout StandardPurpose:\layout StandardParallel and operation on all 128 bits\layout ItemizeANDNPD xmm, xmm \layout StandardLatency: 4 Throughput: 2\layout StandardPurpose:\layout StandardParallel and not operation on all 128 bits\layout ItemizeORPD xmm, xmm \layout StandardLatency: 4 Throughput: 2\layout StandardPurpose:\layout ItemizeXORPD xmm, xmm \layout StandardLatency: 4 Throughput: 2\layout StandardPurpose:\layout StandardParallel xor operation on all 128 bits\layout SubsectionShift Instructions\layout StandardMMX-\layout StandardUsage: instruction destination, source \layout StandardDestination: MMX register. \layout StandardSource(count operand): Either MMX register, a 64-bit memory operand, or an immediate 8-bit operand.\layout ItemizePSLLD Packed Shift Left Logical, Doubleword\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSLLD shifts the bits of the first operand to the left by the amount of bits specified in the count operand and returns the result to the destination register. The empty low-order bits are cleared (set to zero). If the value specified by the count operand is greater than 31 (1Fh) the destination is set to all zeros.\layout ItemizePSLLQ Packed Shift Left Logical, Quadword\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSLLQ shifts the bits of the first operand to the left by the amount of bits specified in the count operand and returns the result to the destination register. The empty low-order bits are cleared (set to zero). If the value specified by the second operand is greater than 63 (3Fh) the destination is set to all zeros.\layout ItemizePSLLW Packed Shift Left Logical, Word\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSLLW shifts the bits of the first operand to the left by the amount of bits specified in the count operand and returns the result to the destination register. The empty low-order bits are cleared (set to zero). If the value specified by the second operand is greater than 15 (0Fh) the destination is set to all zeros. \layout Standard\begin_inset Graphics	filename images_mmx/NewPSLL.gif	lyxscale 65	scale 55	keepAspectRatio\end_inset \layout ItemizePSRLD Packed Shift Right Logical, Doubleword\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSRLD performs a shift-logical-right operation on each of the two doublewords in the destination operand. The count operand determines how many bits to right-shift. The new high-order bits are cleared (set to zero). If the value specified by the count operand is greater than 31 (1Fh) the destination is set to all zeros.\layout ItemizePSRLQ Packed Shift Right Logical, Quadword\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSRLQ performs a shift-logical-right operation on the quardword destination operand. The count operand determines how many bits to right-shift. The new high-order bits are cleared (set to zero). If the value specified by the count operand is greater than 63 (3Fh) the destination is set to all zeros.\layout ItemizePSRLW Packed Shift Right Logical, Word\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSRLW performs a shift-logical-right operation on each of the four words in the destination operand. The count operand determines how many bits to right-shift. The new high-order bits are cleared (set to zero). If the value specified by the count operand is greater than 15 (0Fh) the destination is set to all zeros. \layout Standard\begin_inset Graphics	filename images_mmx/NewPSRL.gif	lyxscale 65	scale 55	keepAspectRatio\end_inset \layout ItemizePSRAD Packed Shift Right Arithmetic, Doubleword\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSRAD performs a shift-arithmetic-right (SAR) operation on each of the two doublewords in the destination operand. The count operand determines the number of bits to right-shift. The new high-order bits of each doubleword are filled with the initial value of the sign bit of the doubleword. If the value specified by count is greater than 31 (1Fh), each doubleword is filled with the initial value of its sign bit.\layout ItemizePSRAW Packed Shift Right Arithmetic, Word\layout StandardLatency : 2 Throughput : 1\layout StandardPurpose:\layout StandardPSRAW performs a shift-arithmetic-right (SAR) operation on each of the four words in the destination operand. The count operand determines the number of bits to right-shift. The new high-order bits of each word are filled with the initial value of the sign bit of the word. If the value specified by count is greater than 15 (0Fh), each word is filled with the initial value of its sign bit.\layout Standard\begin_inset Graphics	filename images_mmx/NewPSRA.gif	lyxscale 65	scale 55	keepAspectRatio\end_inset \layout StandardSSE2-\layout ItemizePSLLDQ xmm1, imm8 \layout StandardDestination: an XMM register.\layout StandardCount: an 8-bit immediate.\layout StandardLatency: 4 Throughput: 2\layout StandardPurpose:\layout StandardShifts the destination operand (first operand) to the left by the number of bytes specified in the count operand (second operand). The empty low-order bytes are cleared (set to all 0s). If the value specified by the count operand is greater than 15, the destination operand is set to all 0s. \layout Standard\begin_inset Graphics	filename images_sse2/NewPSLLDQ.gif	lyxscale 55	scale 55	keepAspectRatio\end_inset \layout ItemizePSRLDQ xmm1, imm8 \layout StandardDestination: an XMM register.\layout StandardCount: an 8-bit immediate.\layout StandardLatency: 4 Throughput: 2\layout StandardPurpose:\layout StandardShifts the destination operand (first operand) to the right by the number of bytes specified in the count operand (second operand). The empty

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -