Efficient Optimization Method for Polynomial Selection

Kim, Suhri;Kwon, Heetaek;Lee, Yongseong;Chang, Nam Su;Yoon, Kisoon;Kim, Chang Han;Park, Young-Ho;Hong, Seokhie;

doi:10.13089/JKIISC.2016.26.3.631

Journal of the Korea Institute of Information Security & Cryptology (정보보호학회논문지)

Volume 26 Issue 3
/
Pages.631-643
/
2016
/
1598-3986(pISSN)
/
2288-2715(eISSN)

Korea Institute of Information Security and Cryptology (한국정보보호학회)

DOI QR Code

Efficient Optimization Method for Polynomial Selection

다항식 선택을 위한 효율적인 최적화 기법

Kim, Suhri (Center for Information Security Technologies(CIST), Korea University) ;
Kwon, Heetaek (Center for Information Security Technologies(CIST), Korea University) ;
Lee, Yongseong (Center for Information Security Technologies(CIST), Korea University) ;
Chang, Nam Su (Sejong Cyber University) ;
Yoon, Kisoon (NSHC) ;
Kim, Chang Han (Semyung University) ;
Park, Young-Ho (Sejong Cyber University) ;
Hong, Seokhie (Center for Information Security Technologies(CIST), Korea University)

김수리 (고려대학교 정보보호연구원) ;
권희택 (고려대학교 정보보호연구원) ;
이용성 (고려대학교 정보보호연구원) ;
장남수 (세종사이버대학교) ;
윤기순 (엔에스에이치씨) ;
김창한 (세명대학교) ;
박영호 (세종사이버대학교) ;
홍석희 (고려대학교 정보보호연구원)

Received : 2016.05.02
Accepted : 2016.06.02
Published : 2016.06.30

https://doi.org/10.13089/JKIISC.2016.26.3.631 Citation PDF KSCI HTML

Download PDF

⟨ Previous Next ⟩

Abstract

Currently, General Number Field Sieve(GNFS) is known as the most efficient way for factoring large numbers. CADO-NFS is an open software based on GNFS, that was used to factor RSA-704. Polynomial selection in CADO-NFS can be divided into two stages - polynomial selection, and optimization of selected polynomial. However, optimization of selected polynomial in CADO-NFS is an immense procedure which takes 90% of time in total polynomial selection. In this paper, we introduce modification of optimization stage in CADO-NFS. We implemented precomputation table and modified optimization algorithm to reduce redundant calculation for faster optimization. As a result, we select same polynomial as CADO-NFS, with approximately 40% decrease in time.

현재까지 알려진 가장 효율적인 인수분해 방법은 General Number Field Sieve (GNFS)를 이용하는 방법이다. CADO-NFS는 GNFS를 기반으로 구현된 공개된 소프트웨어로 RSA-704의 인수분해에 사용된 도구이다. CADO-NFS에서 다항식 선택은 크게 다항식을 생성하는 과정과 이를 최적화하는 과정으로 나누어져 있다. 그러나 CADO-NFS에서 다항식의 최적화 과정은 전체 다항식 선택 소요 시간 중 약 90%를 차지할 정도로 큰 부하를 주고 있다. 본 논문에서는 사전 연산 테이블을 이용하여 다항식 최적화 과정의 부하를 줄이는 방안을 제안한다. 제안하는 방법은 기존 CADO-NFS의 다항식과 같은 다항식을 선택하지만, 다항식 선택에 걸리는 시간은 약 40% 감소한다.

Keywords

I. Introduction

RSA cryptosystem is one of the most widely used public key cryptosystem for providing privacy and ensuring authenticity of digital data. The security of RSA cryptosystem is based on hardness of factoring large numbers. In RSA cryptosystem, public modulus N is chosen as product of two distinct primes p, q of same size[1]. Let e, d be two integers satisfying ed = 1 modΦ(N) where Φ(N) = (p – 1)(q – 1). The pair (N, e) is called public key and d is called private key. Note that by factoring N = pq to obtain p,q, it is easy to find private key. Hence integer factorization is one of main topics for research.

General Number Field Sieve (GNFS) is currently best known algorithm for factoring large numbers over 110 digits[9]. Factoring based on GNFS is recently performed by Kleinjung et al. in 2009 for RSA-768[6], and also by Bai et al. in 2012 for RSA-704[5].

Implementation of GNFS includes GGNFS, Msieve, and CADO-NFS. GGNFS is best optimized for factoring up to 160 digit integers and Msieve is best optimized for factoring up to 130 digit integers. Meanwhile CADO-NFS was used to factor RSA-704 (212 digit) so that it is reasonable to consider CADO-NFS for factoring integers over 300 digits[9].

CADO-NFS selects polynomial using Kleinjung’s second algorithm and optimizes selected polynomial in two perspective for better performance in sieving stage[2]. Since optimization of selected polynomial takes roughly 90% of total polynomial selection stage, modification is needed for faster polynomial selection. We conclude that redundant calculation of values delays optimization time even more. Hence we implemented precomputation table and modified root optimization procedure for faster selection. In this way, we can generate polynomial with same Murphy E value as CADO-NFS in shorter time.

This paper is organized as follows : In Section 2, we cover the concept of GNFS and describe properties that good polynomial should have. In Section 3, we briefly describe optimization of polynomials and focus on its implementation in CADO-NFS. In section 4, we present our modification of root optimization process. Following experimental result for our modification is presented in Section 5. Finally, we conclude our result in Section 6.

II. Background

In this Section, we briefly describe modern factoring algorithm, GNFS. Next, we introduce classical method for polynomial selection in GNFS. Lastly, two measurement that quantifies polynomial’s qualities are presented.

2.1 General Number Field Sieve

Most of modern factoring algorithms are based on ‘Difference of squares.’ For n = pq where p, q are two distinct primes, ‘Difference of squares’ focuses on finding two random integers x, y such that x ≠ ymodn and x² ≡ y²modn. Then by computing gcd(x – y, n) and gcd(x + y, n), we can obtain non-trivial factor of n with high probability. This idea is extended to Quadratic Sieve(QS) and later developed into GNFS.

Instead of searching for random integers as in ‘Difference of squares,’ GNFS searches irreducible monic polynomial f(x) of degree greater than 1 and monic linear polynomial g(x) such that f(x) and g(x) have common root modulo n[7,8].

Let α and β be roots of f(x), g(x) not in Z_n, respectively, and consider rings Z[α], Z[β]. Goal in GNFS is to find (a, b) pairs such that a – bα and a – bβ are smooth over chosen basis of primes. We say that an element is smooth if all of its factors are member of our chosen basis of primes.

We collect (a, b) pairs where \(\prod(a-b \alpha)=X^{2}\) for X ∈ Z[α] and \(\prod(a-b \beta)=Y^{2}\) for Y ∈ Z[β]. Consider homomorphisms from ring Z[α] and Z[β] to Z_n that maps α and β to m. Then there exist x,y ∈ Z_n such that X,Y are mapped to x, y respectively. Hence x² ≡ y²modn is again obtained and non-trivial factors of n can be found with high probability.

Generally, GNFS is divided into four stages – polynomial selection, sieving, linear algebra, square roots – but we focus on polynomial selection stage of GNFS. Namely, polynomial selection where we select f(x) and g(x). This is because that sieving takes over 90%, 80% of total time for factoring 512 bit, 768 bit numbers, respectively, and choice of polynomial dramatically affect time to complete sieving. In next Section, we briefly describe method for polynomial selection.

2.2 Classical Polynomial Selection

Classic way to generate polynomial is using base-m method[11]. The base-m method expresses number n to be factored as n = m^d + c_d–1m^d–¹ + … + c₀, where |c_i| ≤ \(\frac{m}{2}\)for each i, and generates two monic polynomials f(x) of degree d and g(x) = x - m of degree 1 with mmodn as common root between f(x) and g(x). To reduce size of coefficient of f(x), this method is modified to select non-monic polynomial \(f(x)=\sum_{i=0}^{d} c_{i} x^{i}\) and g(x) = m₂x – m₁ such that \(n=\sum_{i=0}^{d} c_{d-i} m_{1}^{d-i} m_{2}^{i}\) [3]. We choose m to be close to (n/c_d)^1/d[4]. If c_d–₂ is not small enough, try another c_d. Otherwise we optimize the generated polynomial pair.

2.3 Quantifying Quality of Polynomials

In Section 2.1, we highlighted the importance of selecting good polynomial. Since selection of polynomial greatly affect the number of relations to be found, we want to select good polynomial in order to expect good performance in sieving. Hence quantifying quality of found polynomials is needed. In this subsection, we present three measurements of quality, namely, lognorm, α-value, and Murphy E. Lognorm and α-value relates to size and root property respectively, and Murphy E score is combination of size and root properties.

Recall that aim of sieving stage is to collect many relations (a,b) such that a -bα is smooth over chosen basis of primes where α is root of f(x). Generally, chosen basis consist of small primes and hence small value of norm of f(x) is more likely to be factored by such basis. Thus, we calculate ‘lognorm’ of polynomial. Lognorm is logarithmic average of polynomial values across sieving region, and it is computed as below[4].

\(\frac{1}{2} \log \left(s^{-d} \int_{0}^{2 \pi} \int_{0}^{1} F^{2}(s \cos \theta, \sin \theta) r^{2 d+1} d r d \theta\right)\).

In above equation, s refers to skewness of sieving region, calculated by ratio of a, b [4]. Hence small lognorm means size of polynomial is small so that it is more likely to be smooth over our chosen basis of primes. Thus we are searching for polynomial with smaller lognorm as possible. Since g(x) is linear, we may assume size of g(x) does not vary much across sieving region than f(x). Therefore, in practice we only focus on lognorm of f(x).

If a polynomial f(x) has many roots modulo small prime powers, then we can expect that polynomial values to behave more smoothly than random integers about the same size. We define expected p-valuation of x as v_p(x), where value of v_p(x) is exponent of the largest power of prime p dividing x in set of integers S, and v_p(0) = ∞. We use same notation for polynomials to let v_p(f) refer to expected p-valuation of f in set S.

Murphy defined α(F) function to compare cumulated expected p-valuation of polynomial values to random integers of similar size[4]. Hence, α(F) can be considered as logarithmic benefit of using polynomial values compared to using random integers. We call α(F) as α-value of polynomial and is defined as below.

\(\begin{gathered} \alpha(F)=\sum_{p \leq B}\left(\frac{1}{p-1}-\frac{n_{p} p}{p^{2}-1}\right) \log p \\ =\sum_{p \leq B}\left(1-\frac{n_{p} P}{p+1}\right) \frac{\log p}{p-1} \end{gathered}.\)

In above equation, n_p refers to number of simple or multiple root of f(x) modulo p^e for p^e ≤ B . Combining f(x) and g(x) we can approximate number of sieving reports as equation below[4,11].

\(\begin{aligned} \frac{6}{\pi^{2}} \iint_{\Omega} \rho\left(\frac{\log |F(x, y)|+\alpha(F)}{\log B_{1}}\right) \\ \rho\left(\frac{\log |G(x, y)|+\alpha(G)}{\log B_{2}}\right) d x d y \end{aligned}\) .

Above measurement is called ‘Murphy E’ of polynomials. Since collecting as many relation as possible is goal in sieving stage, larger Murphy E implies that it is more likely to have large number of sieving reports in sieving stage. Hence we focus on selecting f(x), g(x) with larger Murphy E value.

III. Polynomial Optimization

Through polynomial generation described in Section 2.2, we can obtain polynomial whose size of first three leading coefficients are small. We namely call this output f(x) of polynomial generation stage as ‘raw polynomial.’ However to have better performance in sieving stage, optimization of raw polynomial is necessary.

In this section, two optimization stages for better lognorm and Murphy E, are described. In Section 3.1, size optimization for smaller lognorm is described. In Section 3.2, root optimization for larger Murphy E is described. Polynomial optimization takes 90% of total time of polynomial generation stage, and root optimization for better Murphy E values takes 90% of total time in polynomial optimization.

3.1 Size Optimization

The goal of size optimization is to obtain polynomial f(x) with smaller lognorm. Note that for \(f(x)=\sum_{i=0}^{d} c_{i} x^{i}\). size of first three leading coefficient c_d, c_d–₁, and c_d–₂are controlled during polynomial generation. However other coefficients are left uncontrolled. Thus size optimization focuses on controlling c_d–3 through translation and other coefficients through rotation.

We call computing f(x + k) and g(x + k) as translation. To reduce size of c_d–₃ effectively, right choice of k is needed. Let \(f(x+k)=\sum_{i=0}^{d} c_{i}(x+k)^{i}=\sum_{i=0}^{d} a_{i} x^{i}\) . We want a_d–₃ to be small after translation. Note that \(a_{d-3}=\left(\begin{array}{l} d \\ 3 \end{array}\right) c_{d} k^{3}+\left(\begin{array}{l} d \\ 2 \end{array}\right) c_{d-1} k^{2}+d c_{d-2} k+c_{d-3}\)and consider a_d-3 as function of k. Then we may write a_d-3 as \(h(k)=a_{d-3}=\left(\begin{array}{l} d \\ 3 \end{array}\right) c_{d} k^{3}+\left(\begin{array}{l} d \\ 2 \end{array}\right) c_{d-1} k^{2}+d c_{d-2} k+c_{d-3}\). Let a be nearest integer of k where h(k) = 0. Then translating f(x) by a, we can expect to have small a_d–₃. Since degree of h(k) is 3, there are at most 3 values of a for translation. We calculate f(x + a) for each a and check its lognorm. If lognorm of f(x + a) is smaller than f(x), we replace f(x) by f(x + a). If not, f(x) is left unchanged and we continue to next optimization stage.

After translation, we rotate f(x) and obtain f(x) + (wx² + ux + v)g(x) to reduce size of last three coefficients. As stated above, we use quadratic rotation in order to preserve the size properties of translated polynomial. Hence after each rotation, lognorm is calculated to check polynomial’s size property. If lognorm of rotated polynomial is larger than original, f(x) remain unchanged and move to root optimization stage.

As a summary, total procedure of size optimization is described in Algorithm 1[4]. Note that L²(f) indicates lognorm of f(x).

Algorithm 1

Input : Polynomial pair \(f(x)=\sum_{i=0}^{d} c_{i} x^{i}\),

g(x) = m₂x – m₁

Integers U, V, B

Output : polynomial pair f(x), g(x) of smaller lognorm.

1. k = w = u = v = 1;

2. while minimum is found or loop limit is reached do

2.1 f'(x) = f(x ± k), g'(x) =g(x) ± km₂

2.2 if either L²(f') < L²(f) then

2.2.1 f = f', g = g', k = 2k

2.3 else

2.3.1 k = [ k/2 ]

2.4 f'(x) = f(x) ± wx²g(x)

2.5 if either L²(f') < L²(f) then

2.5.1 f = f', w = 2w

2.6 else

2.6.1 w = [w/2]

3. returen f(x), g(x)

3.2 Root Optimization

In order to have good root property in terms of α-values, root optimization is performed after adjusting polynomial size through size optimization. Good root property roughly requires polynomial to have many roots modulo small prime and prime powers. Total outline of root optimization procedure is as follows. We rotate f(x) and check root property of f(x) + h(x)g(x). That is, we record roots modulo small prime and prime powers of f(x) + h(x)g(x). After rotation has finished for all possible h(x) we replace f(x) to one of f(x) + h(x)g(x) having highest Murphy E values among all others. We consider quadratic rotation in order to preserve size property obtained by size optimization. Thus h(x) can be written as h(x) = wx² + ux + v, and we search for (w, u, v) such that rotated polynomial f(x) + (wv² + ux + v)g(x) has good root properties. Since linear rotation is faster than quadratic rotation, we first search for w, sets f(x) = f(x) + wx²g(x) and find (u, v) such that f_u,v = f(x) + (ux + v)g(x) has good root property by linear rotation. Intuitive approach for root optimization is to check α-value of f_u,v for all possible (u, v). However, since bound of u, v are huge, this is time consuming process. Therefore, Murphy uses sieve-like procedure to find polynomial with good root properties[11]. Idea is that when r is root of f_u,v(x) ≡ 0modp^e , f_{u+ip^e,v+jp^e} (x) ≡ f_u,v(x) modp^e so that r is also root of f_{u+ip^e,v+jp^e} (x) ≡ 0modp^e for i, j ∈ Z.

Let B be bound for small primes and U, V be bound for u and v respectively. Algorithm 2 describes Murphy’s root sieve which is used as base method for CADO-NFS’s root optimization[4].

Algorithm 2

Input : Polynominal pair f(x), g(x), Integers U, V, B

Output : array of approximated α-values of dimension U×V

1. For p ≤ B do

1.1 For e where p^e ≤ B do

1.1.1 For x ∈ [0, p^e-1] do

1.1.1.1 For u ∈ [0, p^e-1] do

1.1.1.1.1

compute v in f_u,v(x) ≡ 0modp^e

1.1.1.1.2

update v_p(f_{u+ip^e,v+jp^e}) by sieving

3.3 Application in CADO-NFS

In Section 3.2, Algorithm 2 works for small bounds U, V. However, for polynomial of degree greater than 6, permissible U, V bounds are large. Hence CADO-NFS uses modification of Algorithm 2 for faster root sieve[4]. The core idea is that if r is simple root of f(x) ≡ 0modp, then p-valuation can be easily estimated so that there is no need to count the lifted roots. We determine simplicity of roots by using Hensel’s lemma. We call root r is simple root modulo p if f′(r) ≠ 0modp and multiple root modulo p if f′(x) ≡ 0modp.

Recall that if r_e is a simple root of rotated polynomial f_u,v(x)modp^e for e ≥ 1 then r_e is also simple root for f_{u+ip^e,v+jp^e}modp^e. Since the contribution of the root r_e to v_p(F_u,v) is p/(p²–1), we can update the score for all rotated polynomials of the form f_u+ip,v+jp (x) in sieve.

If r_e is a multiple root of f(x)modp^e and f(r_e) ≡ 0modp^e+1 then r_e can lifted to p number of roots modulo p^e+1 such that r_e+1 = r_e + lp^e for all integer l ∈ [0, p). Additionally, the lifted roots r_e+1are also multiple since f'(r_e+1) ≡ 0modp. Whereas if f(r_e)≠ 0modp^e+1 , r_e cannot be lifted to a root modulo p^e+1. Note that in order for r to be multiple root modulo p for some rotated polynomial f_u,v(x)modp, it should satisfy equation f(r) + (ur + v)g(r) ≡ 0modp and f'(r) + ug(r) (ur + v)g'(r) ≡ 0modp. Since (ur + v) ≡ –f(r)/g(r) modp , we can obtain following equation.

ug²(r) ≡ f(r)g'(r) – f'(r)g(r)modp (1)

Hence if r is root of f(x) modulo p and if

ug²(r) – f(r)g'(r) + f'(r)g(r) (2)

is divisible by p, then r is multiple root of f(x) modulo p. If r is multiple root of f_u,v(x)modp, lifting is needed for counting the lifted roots.

CADO-NFS divides root optimization into two stages. In the first stage, it searches for pair (u₀, v₀) mod(\(p_1^{e_1}\)... \(p_m^{e_m}\)) such that \(f_{u_{0}, v_{0}}(x)\) has many roots modulo prime powers for first m smallest primes. In second stage, it applies root sieve to \(f_{u_{0}, v_{0}(x)}\) for larger prime powers up to some bound B.

Naive way for selecting (u₀, v₀) for given f(x) in first stage is to generate matrix of pairs of (u, v) with size \(\left(\prod_{i=1}^{m} p_{i}^{e_{i}}\right)^{2}\) and chose one (u, v) that best generates polynomial with good root property. This approach is possible only if size of matrix is small. In CADO-NFS, m individual polynomials \(f_{u_{i}, v_{i}, p_{i}}(x)\) (1 ≤ i ≤ m) is founded for each prime p_i that has many roots modulo small \(p_i^{e_i}\). Then by Chinese Reminder Theorem(CRT) we calculate (u₀, v₀) mod(\(p_1^{e_1}\)... \(p_m^{e_m}\)) to obtain combined polynominal \(f_{u_{0}, v_{0}}(x)\). Note that \(f_{u_{0}, v_{0}}(x) \bmod p_{i}^{e_{i}}\) has same number of roots as \(f_{u_{i}, v_{i}, p_{i}}(x) \bmod p_{i}^{e_{i}}\) for 1 ≤ i ≤ m. Since we selected (u_i, v_i) such that \(f_{u_{i}, v_{i}}(x) \bmod p_{i}^{e_{i}}\) to have many roots, combined polynomial \(f_{u_{0}, v_{0}}(x)\) is also likely to have many roots modulo small prime powers of \(p_1^{e_1}\)... \(p_m^{e_m}\).

Second stage of root sieve is analogous to first stage except for the fact that it uses larger primes. Let \(M=\prod_{i=1}^{m} p_{i}^{e_{i}}\) and (u₀, v₀) be fixed as in first stage. Recall that we used sub-lattice defined by (u, v) in first stage. In second stage, we do root sieve for larger prime powers on sub-lattice defined by (u₀ + γM,v₀+ βM).

As a summary, modification of Algorithm 2 for faster root sieve is described in Algorithm 3[4].

Algorithm 3

Input : Polynominal pair f(x), g(x), Integers U, V, B

Output : an array of murphy E of dimension U× V

1. For p ≤ B do

1.1 For x ∈ [0, p – 1] do

1.1.1 compute \(\tilde{u}\) such that

\(\tilde{u} g^{2}(x) \equiv f(x) g^{\prime}(x)-f^{\prime}(x) g(x) \bmod p\)

1.2. For u ∈ [0, p – 1] do

1.2.1 comute v such that

f(x) + uxg(x) + vg(x) ≡ 0modp

1.2.2. if \(u \neq \tilde{u}\)

1.2.2.1 then update v_p(f_u+ip,v+jp) in sieving

1.2.3 else

1.2.3. lift to count multiple roots of \(f_{\tilde{u}, \tilde{v}}(x) \bmod p^{e}\) such that

\((\tilde{u}, \tilde{v}) \equiv(u, v) \bmod p\) , \(\tilde{u}, \tilde{v} \leq p^{e}\),

p^e ≤ B and then sieve ;

IV. Proposed Method

In this Section, we proposed our modification for faster root sieve. We implemented precomputation table to reduce the amount of redundant calculation. We also modified searching procedure at beginning of second stage of root sieve and reduced number of sub-lattice for faster root sieve.

4.1 Table Implementation

Since root optimization in CADO-NFS has redundant calculation of values, one way to optimize CADO-NFS’s root optimization procedure is to reduce the number of such calculation. Note that for each prime p, and for each 0 ≤ u < p and for each 0 ≤ r < p , equation (2) is calculated in order to determine whether root is simple or not. Note that when p = 2, equation (2) is calculated for (u, r) = (1, 1). But when p = 3, equation (2) is again calculated for (u, r) = (1, 1). Hence f(r) for 0 ≤ r < p is calculated repeatedly for some value.

Thus instead of calculating equation (2) for each u and r, we precompute

{ug(r)² – f(r)g'(r) + f'(r)'g(r)}modp (3)

and record this value along with u, r, p. In this way, we can look up for value in table when needed instead of calculating every time. Note that since prime p is smaller than 199 in practice, we can make size of each entry of precomputation table to be 8 bits.

When generating precomputation table, order of computation for calculating ug(r)² is modified. In CADO-NFS, it fixes value u and calculates ug(r)² for 0 ≤ r < p . For example, when u = 2, it calculates g(0)² × 2, g(1)² × 2 , and so on. Instead, we fix value r and calculate ug(r)². For example, when r = 0, we calculate g(0)²× 2, g(0)²× 3, and so on. By modifying the order of calculation, multiplication is not needed – when calculating g(0)²× u we add g(0)² instead. Hence if we precalculate g(r)² for 0 ≤ r < 199 , we can generate ug(r)² only by addition. Since addition is faster than multiplication, we can expect shorter computation time.

Comparison of computation between CADO-NFS and table implementation is shown as Table 1.

Table 1. Comparison of calculation of ug(r)² – f(r)g'(r) + f'(r)'g(r)

Number of computation is examined based on RSA-768 number with leading coefficient of f(x) as 265482057982680[6]. Total number of calculation refers to the number of multiplication needed for calculating equation (2) during root optimization period. Since f(r)g'(r) and f'(r)g(r) are precalculated, we only need to consider 2 multiplications when computing equation (2). Specifically, we consider 2 multiplications needed for ug(r)² – one for computing g(r) × g(r) and one for computing u × g(r)². Total number of division refers to the number of division for checking existence of g(r)^–1modp and computation of equation (3) that checks whether computed root is simple root or multiple root.

In summary, total 2 multiplications and 2 divisions occurs for one round of root sieve in root optimization. Since there exist average 10,000 rounds of root sieve per one pair (p₁, p₂) of primes, this table implementation can lead to significant decrease in number of computation.

4.2 Modification of Root Sieve Process

In first stage of root sieve in CADO-NFS, it searches for top 3 w values and records (w, u, v) along with α-values in array such that

f(x) + (wx² + ux + v)g(x)

has good root property for first 10 smallest primes. In beginning of second stage of root sieve, it extracts (w, u, v) pairs recorded in array and rotate f(x) to generate f(x) + (wx² + ux + v)g(x). Next it tests root property of rotated polynomial for first 45 primes and reorder (w, u, v) by α-value. After this step, it focuses on searching good (u, v) for larger prime that has good root property.

However, at beginning of second stage, testing root properties with larger primes does not change any (w, u, v) of recorded pairs. In other words, elements in input array is same as output array except for recorded order. Fig. 1 and Fig. 2 depicts the recorded (u, v) pairs in end of first stage and after first step of second stage, respectively.

Fig. 1. Recorded (u, v) pair after first stage

Fig. 2. Recorded (u, v) pair after first step of second stage

In Fig. 1 and Fig. 2, x-axis indicates value of u and y-axis indicates value of v. As shown in Fig. 1 and Fig. 2, no extra (u, v) was recorded to array nor extra (u, v) was deleted in array. This result is natural since expected p-valuation

decreases as prime increases. Hence effect on x-value of polynomial due to inclusion of more primes can be negligible. Therefore, it is possible to delete beginning of second stage and immediately move to step where deep search on (u, v) for larger prime is performed.

As a summary, total procedure of modification of root optimization algorithm is described in Algorithm 4.

Algorithm 4

Input : Polynominal pair f(x), g(x), Integers U, V, B

Array of dimension :

Output : an array of murphy E of dimension U×V

1. Precomputation

1.1 For p ≤ B do

1.1.1 For x ∈ [0, p – 1] do

1.1.1.1 For u ∈ [0, p – 1] do

1.1.1.1.1 Comute

H = ug²(x) – f(x)g'(x) + f'(x)g(x)modp

1.1.1.1.1 Save H in array A[x][u][p]

2. For p ≤ B do

2.1 For x ∈ [0, p – 1] do

2.1.2. For u ∈ [0, p – 1] do

2.1.2.1. comute v such that

f(x) + uxg(x) + vg(x) ≡ 0modp

2.1.2.2. if A[x][u][p] = 0

2.1.2.2.1 update v_p(fu+ip, v+jp) in sieving

2.1.2.3 else

2.1.2.3.1 lift to count multiple roots of \(f_{\tilde{u}, \tilde{v}}(x) \bmod p^{e}\) such that

\((\tilde{u}, \tilde{v}) \equiv(u, v) \bmod p\) , \(\tilde{u}, \tilde{v} \leq p^{e}\).

p^e ≤ B and then sieve ;

V. Experimental Result

Experiments were performed using gcc version 4.9.2 with Intel Core i5-4690K processor at 3.5GHz with 8GB RAM. We used CADO-NFS 2.1.1 version for size optimization.

Table 2 is the experimental result for average time took for 6 number of root optimizations. Experiment was done with RSA-768 number with 265482057982680 as leading coefficient of degree 6 polynomial f(x)[6]. We used P=1000000 so that p₁, p₂ ∈ [P,2P], where p₁, p₂ are prime factor of leading coefficient of g(x).

Table 2. Comparison of Average Time on Root Optimization.

In Table 2, total time for polynomial selection refers to time it takes to search polynomial for one leading coefficient of f(x). This includes polynomial generation along with size and root optimization. With our selection of input parameters, CADO-NFS found 6 collision pairs (p₁, p₂) so that 6 number of root optimization occurred. Hence average time in Table 2 refers to average time for one polynomial selection. As stated in Table 1, due to our modification, we select same polynomial as in CADO-NFS with 41.5% decrease in total time for polynomial selection.

Table 3 denotes the experiment results for average time took for 24 number of root optimizations. Experiment was done with RSA-768 number with f(x) of degree 6. In Table 3, ad range refers to the range of leading coefficient of f(x). We used P = 10000.

Table 3. Comparison of Average Time on Root Optimization.

As in Table 3, our modified version selects same polynomial as in CADO-NFS. Furthermore, it optimizes polynomial in 55.94% decrease in optimization. This decrease in total time for polynomial selection.

VI. Conclusion

In this paper, we proposed table implementation of root optimization in CADO-NFS. When generating precomputation table, we modified the order of computation for faster calculation. As a result, we calculated values using addition whereas CADO-NFS used multiplication to obtain same value.

Additionally, we reduce size of sub-lattice for faster root sieve. By experiment, reducing number of sub-lattice does not effect the quality of output polynomial. In this way, we can generate polynomial with Murphy E value same as CADO-NFS but in shorter time.

Note that since Murphy E value greatly relates to number of expected relation in sieving stage, simply reducing time in polynomial selection without considering output polynomial’s Murphy E value is useless. However, as in Section 5, our modified version selects same polynomial as in CADO-NFS but in shorter time. Thus we expect to search more polynomials than CADO-NFS in same amount of time and hence have higher probability of selecting better polynomial.

* 이 논문은 2014년도 정부(미래창조과학부)의 재원으로 한국연구재단-차세대 정보x컴퓨팅기술개발사업의 지원을 받아 수 행된 연구입니다(No. NRF-2014M3C4A7030649)

References

R. Rivest, A. Shamir, L. Adleman, "A Method for Obtaining Digital Signature and Public-Key Cryptosystems," ACM, vol.21(2), pp.120-126, 1978. https://doi.org/10.1145/359340.359342
T. Kleinjung. "Polynomial selection," In CADO workshop on integer factorization, INRIA Nancy, http://cado.gforge.inria.fr/workshop/slides/kleinjung.pdf. 2008.
T. Kleinjung. "On polynomial selection for the general number field sieve," Mathematics of Computation, pp. 2037-2047, 2006.
S. Bai "Polynomial Selection for the Number Field Sieve," Ph.D. Thesis ,The Australian National University, 2011.
S. Bai, E. Thom'e, P. Zimmermann. Factorisation of RSA-704 with CADO-NFS. Report, http://eprint.iacr .org/2012/369.pdf.
T. Kleinjung, K. Aoki, J. Franke, A. K. Lenstra, E. Thom'e, J. W. Bos, P. Gaudry, A. Kruppa,P. L. Montgomery, D. A. Osvik, H. J. J. te Riele, A. Timofeev, and P. Zimmermann. "Factorization of a 768-bit RSA modulus," CRYPTO '10, vol.6223 LNCS, pp 333-350, 2010.
A. K. Lenstra and H. W. Lenstra, Jr., editors. "The Development of the Number Field Sieve," vol. 1554 of Lecture Notes in Mathematics. Springer, 1993.
Matthew E. Briggs "An Introduction to the General Number Field Sieve," Master Thesis. Virginia Polytechnic Institute and State University. April, 1998.
P. Gaudry, A. Kruppa, et al. "CADO-NFS," http://cado-nfs.gforge.inria.fr
B. A. Murphy, R. P. Brent, "On Quadratic Polynomials for the Number Field Sieve," CATS'98, pp 199-231, 1998.
B. A. Murphy, "Polynomial Selection for the Number Field Sieve Integer Factorization Algorithm," Ph.D. Thesis, The Australian National University, 1999.

Journal of the Korea Institute of Information Security & Cryptology (정보보호학회논문지)

Efficient Optimization Method for Polynomial Selection

다항식 선택을 위한 효율적인 최적화 기법

Abstract

Keywords

I. Introduction

II. Background

2.1 General Number Field Sieve

2.2 Classical Polynomial Selection

2.3 Quantifying Quality of Polynomials

III. Polynomial Optimization

3.1 Size Optimization

3.2 Root Optimization

3.3 Application in CADO-NFS

IV. Proposed Method

4.1 Table Implementation

4.2 Modification of Root Sieve Process

V. Experimental Result

VI. Conclusion

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)