From 1f6223975563c2eff6ad5bbed42b0bc8cf9a80cf Mon Sep 17 00:00:00 2001 From: Sun Yimin Date: Thu, 23 Dec 2021 21:25:30 +0800 Subject: [PATCH] Updated MFMM (markdown) --- MFMM.md | 19 ++++++++----------- 1 file changed, 8 insertions(+), 11 deletions(-) diff --git a/MFMM.md b/MFMM.md index 73f7caf..9c7f38d 100644 --- a/MFMM.md +++ b/MFMM.md @@ -153,33 +153,30 @@ acc0, acc1, acc2, acc3, acc4, acc5是64位寄存器 考虑以下算法(主要就是一轮加法,一轮减法): acc4, acc3, acc2, acc1 - + acc0, 0, 0, (acc0 - L(acc0*2^32)) - - H(acc0*2^32) L(acc0*2^32) H(acc0*2^32) + + acc0, 0, 0, acc0 + - H(acc0*2^32) L(acc0*2^32) H(acc0*2^32) L(acc0*2^32) =》继续优化 acc4, acc3, acc2, acc1 - + (acc0 - H(acc0*2^32)), 0, 0, (acc0 - L(acc0*2^32)) - - L(acc0*2^32) H(acc0*2^32) + + (acc0 - H(acc0*2^32)), 0, 0, acc0 + - L(acc0*2^32) H(acc0*2^32) L(acc0*2^32) -acc0 - H(acc0 * 2^32) >= 0, acc0 - L(acc0 * 2^32) >= 0 显然。 +acc0 - H(acc0 * 2^32) >= 0显然。 MOVQ acc0, AX MOVQ acc0, DX SHLQ $32, AX SHRQ $32, DX - MOVQ acc0, t0 - SUBQ AX, t0 - SUBQ DX, acc0 - ADDQ t0, acc1 + ADDQ acc0, acc1 ADCQ $0, acc2 ADCQ $0, acc3 ADCQ acc0, acc4 ADCQ $0, acc5 - + SUBQ AX, acc1 SUBQ DX, acc2 SBBQ AX, acc3 - SBBQ $0, acc4 + SBBQ DX, acc4 SBBQ $0, acc5 ### 第三步,计算 X * Y1,并且和tmp相加