Decidability of extensions of Presburger arithmetic by generalised polynomials


Abstract

We show that the extension of Presburger arithmetic by a quadratic generalised polynomial of a specific form is undecidable.

1 Introduction↩︎

1.1 Background↩︎

Presburger arithmetic, that is, the first-order theory of natural numbers with addition \(\mathrm{Th}(\mathbb{N};+)\), is well-known to be decidable. This is in contrast with Peano arithmetic, which also includes multiplication and is well-known to be undecidable. It is not hard to see that if we extend Presburger arithmetic by adding the square function, then we obtain Peano arithmetic; indeed, it is enough to notice that \(a \times b = c\) if and only if \((a + b)^2 - a^2 - b^2 = 2c\) (for a survey of relations between different extensions of Presburger arithmetic, see e.g.[1]). On the other hand, the extension of Presburger arithmetic by an exponential function is decidable [2], [3]. We are thus led to ask: Which functions can we add to Presburger arithmetic and still obtain a decidable theory? For the sake of the discussion below, we point out that the first-order theory of integers with addition and order \(\mathrm{Th}(\mathbb{Z};<,+)\) is definitionally equivalent with Presburger arithmetic; we will consider extensions of either \(\mathrm{Th}(\mathbb{N};+)\) or \(\mathrm{Th}(\mathbb{Z};<,+)\) depending on which is more natural in the given context.

Results concerning decidability of extensions of Presburger arithmetic connected with the multiplicative structure are surveyed in [4]. A wide class of decidable extensions of Presburger arithmetic was investigated by Semenov [2], [5]. He showed that \(\mathrm{Th}(\mathbb{N};+,f)\) is decidable for each \(f \colon \mathbb{N}\to \mathbb{N}\) which fulfils the condition of effective compatibility with addition, which asserts, roughly speaking, that \(f\) increases rapidly enough and is periodic modulo any integer. An alternative proof of this result was obtained by Cherlin and Point [3]. In particular, letting \(E_k\) denote the exponential function \(E_k(n) = k^n\), this result implies that \(\mathrm{Th}(\mathbb{N};+,E_k)\) is decidable. Similarly, letting \(F\) denote the factorial \(F(n) = n!\), we have that \(\mathrm{Th}(\mathbb{N};+,F)\) is decidable.

A \(k\)-automatic sequence is a sequence \(f \colon \mathbb{N}\to \Sigma\), taking values in a finite alphabet \(\Sigma\), such that \(f(n)\) can be computed by a finite automaton given the base-\(k\) expansion of \(n\) as input. For more background on automatic sequences, see [6]. It was shown by Büchi [7] (with later corrections, for further discussion see e.g.[8], [9]) that for each \(k \geq 2\) and each \(k\)-automatic sequence \(f \colon \mathbb{N}\to \Sigma\) the first-order theory \(\mathrm{Th}(\mathbb{N};+,f)\) is decidable1. In fact, it is known that \(\mathrm{Th}(\mathbb{N};+,V_k)\) is decidable and \(k\)-automatic sequences are precisely the sequences whose level sets are definable in \((\mathbb{N};+,V_k)\). Here, \(V_k \colon \mathbb{N}\to \mathbb{N}\) denotes the map which takes \(k^i n\) to \(k^i\), where \(i \in \mathbb{N}\), \(n \in \mathbb{N}\) and \(k \nmid n\). For further discussion of extensions of Presburger arithmetic related to digital expansions, we refer to the survey paper [8] (cf.[9]) and to more recent publications such as [10] and [11]. More exotic numeration systems are also studied for instance in [12][14] and [15].

A Sturmian sequence is a sequence \(s_{\alpha,\rho} \colon \mathbb{N}\to \{0,1\}\) given by \[\begin{align} \label{eq:intro:def-sturm} s_{\alpha,\rho}(n) &= \left\lfloor \alpha(n+1) + \rho \right\rfloor - \left\lfloor \alpha n + \rho \right\rfloor , & n \in \mathbb{N}, \end{align}\tag{1}\] where \(\alpha\in (0,1) \setminus \mathbb{Q}\) and \(\rho \in [0,1)\). The first-order theory \(\mathrm{Th}(\mathbb{N};+,s_{\alpha,0})\) is decidable if \(\alpha\) is a quadratic irrational and undecidable if the continued fraction expansion is not computable [16]. A more comprehensive treatment of Sturmian sequences was carried out in [17]. The authors prove, among other things, that the first-order theory \(\mathrm{Th}( \left\{ (\mathbb{N};+,s_{\alpha,\rho}) \ \middle| \ \alpha\in (0,1) \setminus \mathbb{Q},\;\rho \in [0,1) \right\} )\) is decidable2.

As already alluded to earlier, for each polynomial sequence \(f \colon \mathbb{Z}\to \mathbb{Z}\) of degree at least \(2\), multiplication is definable in the language of \((\mathbb{Z};+,f)\). It follows that \(\mathrm{Th}(\mathbb{Z};+,f)\) is undecidable, and so is \(\mathrm{Th}(\mathbb{Z};<,+,f)\).

1.2 New results↩︎

In this paper, we are interested in generalised polynomials, that is, expressions built up from the usual real polynomials with the use of the integer part function \(\left\lfloor x \right\rfloor\), addition and multiplication. For instance, the map \(f \colon \mathbb{Z}\to \mathbb{Z}\) given by \(f(n) = \left\lfloor \sqrt{2} n^2 \left\lfloor \sqrt{3}n \right\rfloor \right\rfloor + \left\lfloor \pi n \right\rfloor \left\lfloor n^3/10 \right\rfloor\) is a generalised polynomial. Generalised polynomials can also be thought of as a far-reaching generalisation of the notion of a Sturmian sequence. These sequences have been extensively studied, see e.g.[18][20] and references therein.

In analogy with polynomials, we expect that the extension of Presburger arithmetic by a given generalised polynomial with at least quadratic rate of growth should be undecidable.

Conjecture 1. Let \(g \colon \mathbb{Z}\to \mathbb{Z}\) be a generalised polynomial such that \[\begin{align} \label{eq:conj:g6262n942} \liminf_{n \to \infty} \left|g(n)\right|/n^2 > 0. \end{align}\tag{2}\] Then the first-order theory of \((\mathbb{Z};<,+,g)\) is undecidable.

As a first step in this direction, we verify this expectation for a specific generalised polynomial with quadratic growth. We believe that similar techniques should work with many other explicit examples, but a fully general result remains elusive. Below, for technical reasons, it will be more convenient to work with the nearest integer operation \(\left\lfloor \cdot \right\rceil\), given by \(\left\lfloor x \right\rceil = \left\lfloor x+1/2 \right\rfloor\), rather than with \(\left\lfloor \cdot \right\rfloor\).

Theorem 1. Let \(\alpha,\beta\in \mathbb{R}\), \(\beta\neq 0\), be such that \(1,\alpha,\alpha^2\) are linearly independent over \(\mathbb{Q}\) and let \(g \colon \mathbb{Z}\to \mathbb{Z}\) be the generalised polynomial given by \[\begin{align} \label{eq:def-g=n[an]} && g(n) &= \left\lfloor \beta n \left\lfloor \alpha n \right\rceil \right\rceil & (n \in \mathbb{Z}). \end{align}\qquad{(1)}\] Then the first-order theory of \((\mathbb{Z};<,+,g)\) is undecidable.

In fact, in the situation of Theorem 1, the relation \(>\) is definable in \((\mathbb{Z};<,+,g)\). The main ingredient needed here is a result essentially due to Vicky Neale [21] asserting that there exists \(s \geq 1\) such that each sufficiently large integer \(m\) can be represented as \(m = n_1 \left\lfloor \alpha n_1 \right\rceil + n_2 \left\lfloor \alpha n_2 \right\rceil + \dots + n_s \left\lfloor \alpha n_s \right\rceil\). Since \(\left| g(n) - \beta n \left\lfloor \alpha n \right\rceil \right| \leq 1/2\), it follows that each sufficiently large integer \(m\) can be represented as \(m = g(n_1) + g(n_2) + \dots + g(n_s) + r\) where \(r\) is bounded (in fact, we can roughly estimate \(0 \leq r < r_0 := \left|\beta\right| + s\)). Since \(g(n) \geq 0\) for all \(n \in \mathbb{Z}\), we see for all but finitely many \(m \in \mathbb{Z}\) we have \(m > 0\) if and only if there exist \(n_1,n_2,\dots,n_s \in \mathbb{Z}\) such that for some \(0 \leq r < r_0\) we have \(m = g(n_1) + g(n_2) + \dots + g(n_s) + r\). Thus, Theorem 1 has the following immediate corollary.

Corollary 1. Under the same assumption as in Theorem 1, the first-order theory of \((\mathbb{Z};+,1,g)\) is undecidable.

One can also construct non-trivial examples of bounded generalised polynomials. For instance, is not hard to verify that the sequence \(g \colon \mathbb{Z}\to \{0,1\}\) given by \[\begin{align} \label{eq:97:def-g} g(n) = \begin{cases} 1 & \text{if } \left\lVert n^2 \alpha \right\rVert_{\mathbb{R}/\mathbb{Z}} < \rho,\\ 0 & \text{otherwise,} \end{cases} & n \in \mathbb{Z}, \end{align}\tag{3}\] is a generalised polynomial for any \(\alpha,\rho > 0\). The sequence \(g\) given by 3 can be thought of as a quadratic analogue of a Sturmian sequence. For sequences of this form, we also have an undecidability result.

Theorem 2. Let \(\alpha,\rho \in \mathbb{R}\), be such that \(0 < \rho < 1/4\) and \(\alpha\) is not a linear combination of \(1\) and \(\rho\) with rational coefficients, and let \(g \colon \mathbb{Z}\to \mathbb{Z}\) be the generalised polynomial given by 3 . Then the first-order theory of \((\mathbb{Z};<,+,g)\) is undecidable.

1.3 Proof outline↩︎

A key idea behind the proof of Theorem 1 is that the generalised polynomial in question (or indeed any generalised polynomial) coincides with a polynomial on suitably constructed arbitrarily long arithmetic progressions. Indeed, if we fix an integer \(L\) and take \(m \in \mathbb{N}\) such that \(\left\lVert \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}},\left\lVert \beta m \right\rVert_{\mathbb{R}/\mathbb{Z}}\) are sufficiently small as a function of \(L\) then we can ensure that for \(0 \leq l < L\) we have \[g(l m) = l^2 g(m),\] meaning in particular that the restriction of \(g\) to the arithmetic progression \[P =\{0,m,2m,\dots,(L-1)m\}\] is a quadratic polynomial. Applying the standard ideas used to define multiplication in terms of the square function, we can then describe triples \(a,b,c \in P\) such that \(a = km\), \(b = l m\) and \(c = kl m\). The operation \((m,km,lm) \mapsto klm\) can be thought of as a weak substitute of multiplication. In Section 2 we show that the first-order theory of integers equipped with addition and such “weak multiplication” is already undecidable. In Section 3 we flesh out the details of the argument, which in particular involves defining a suitably large family of arithmetic progressions in the first-order language of \((\mathbb{Z};<,+,g)\).

The proof of Theorem 2 is shorter and conceptually simpler. The key idea is that, using equidistribution results for (generalised) polynomial sequences, we are able to define properties which are, roughly speaking, equivalent to the statements that fractional parts of various expressions are sufficiently small. This allows us to define divisibility, which implies undecidability thanks to a classical result of Robinson [22].

1.4 Future directions↩︎

Our main result, Theorem 1, can no doubt be generalised, and is intended largely as a proof of concept. The assumption that \(1,\alpha,\alpha^2\) are linearly independent is necessary in order to obtain a suitable equidistribution result in Lemma 5 and is most likely not necessary for the result to be true. Below we discuss several other potential future directions.

1.4.1 Slower growth↩︎

One could consider a stronger variant of Conjecture 1 where \(\liminf\) is replaced with \(\limsup\) in 2 . The following example shows that this conjecture is false.

Proposition 1. Let \(g \colon \mathbb{Z}\to \mathbb{Z}\) be given by \[\begin{align} g(n) = \begin{cases} n^2 &\text{if } n = k^{k^m} \text{ for some } m \in \mathbb{N};\\ 0 &\text{otherwise}. \end{cases} \end{align}\] Then \(g\) is a generalised polynomial and \(\mathrm{Th}(\mathbb{Z};<,+,g)\) is decidable.

Proof. It follows from [23] that \(g\) is a generalised polynomial. Letting \(E_k^+\) denote the “exponential” map given by \(E_k^+(m) := k^{\left|m\right|}\), we see that \(g(n) = \ell\) if and only if there exists \(m \in \mathbb{Z}\) such that \(n = E_k^+(E_k^+(m))\) and \(\ell = E_k^+(2E_k^+(m))\). Thus, \(g\) is definable in \((\mathbb{Z};+,E_k^+)\). It remains to note that \(\mathrm{Th}(\mathbb{Z};<,+,E_k^+)\) is decidable because it is definitionally equivalent with \(\mathrm{Th}(\mathbb{N};+,E_k)\), which we have already noted is decidable. ◻

Despite examples such as the one above, it seems plausible that the condition 2 can be weakened to a requirement that \(g\) grows at least quadratically on a significant portion of the integers. For instance, one could require that there exists \(\varepsilon> 0\) such that for each sufficiently large \(N\) there are at least \(\varepsilon N\) integers \(n\) with \(\left|n\right| < N\) and \(\left|g(n)\right| > \varepsilon n^2\).

1.4.2 Hardy field sequences↩︎

Instead of generalised polynomials, one can consider other classes of integer-valued sequences whose behaviour resembles polynomials in some useful sense. A natural class of examples is provided by Hardy field sequences, whose relevant properties are discussed e.g.in [24] or [25]. The definition of a Hardy field function is somewhat technical, but for now suffice it to say that these functions include all logarithmic-exponential functions, i.e., functions defined on \(\mathbb{R}_+\) which can be expressed using the operations \(+,\times,\exp,\log\) and real constants. Thus, for instance, \(x \log x\), \(x^{3/2}\) and \(x^x\) are Hardy field functions of \(x\).

If \(f \colon \mathbb{R}_+ \to \mathbb{R}\) is a Hardy field function such that for some \(d \in \mathbb{N}\) we have \[\begin{align} \label{eq:intro:Hardy-growth} \lim_{x \to \infty} \frac{f(x)}{x^d} & > 0, & \lim_{x \to \infty} \frac{f(x)}{x^{d+1}} & = 0, \end{align}\tag{4}\] (including the case where the first limit is \(\infty\)), then on long intervals \(f\) can be closely approximated by a degree-\(d\) polynomial. Combining this fact with the ideas used in the proof of Theorem 1, it seems plausible that one could show that for a Hardy field sequence \(f \colon \mathbb{R}_+ \to \mathbb{R}\) satisfying 4 with \(d \geq 2\), the first-order theory of \((\mathbb{Z};<,+,\left\lfloor f \right\rceil)\) is undecidable. (Here, \(\left\lfloor f \right\rceil\) denotes the sequence given by \(\left\lfloor f \right\rceil(n) = \left\lfloor f(n) \right\rceil\).) A potentially more interesting question concerns the case where \(d = 1\):

Question 1. Let \(f \colon \mathbb{R}_+ \to \mathbb{R}\) be a Hardy field sequence satisfying \[\begin{align} \label{eq:intro:Hardy-growth-d611} \lim_{x \to \infty} \frac{f(x)}{x} & = \infty, & \lim_{x \to \infty} \frac{f(x)}{x^{2}} & = 0. \end{align}\tag{5}\] Is the first-order theory of \((\mathbb{Z};<,+,\left\lfloor f \right\rceil)\) decidable?

1.4.3 Sparse sequences↩︎

Going a step further than Theorem 2, we can find \(\{0,1\}\)-valued generalised polynomials \(g\) that are \(0\) almost everywhere but take the value \(1\) infinitely often (i.e., \(\frac{1}{N} \sum_{n=1}^N g(n) \to 0\) and \(\sum_{n=1}^N g(n) \to \infty\) as \(N \to \infty\)). In [23] we studied the properties of such generalised polynomials, which we dub sparse. We also constructed a number of examples. For instance, the indicator function of the Fibonacci numbers \(1_{\mathrm{Fib}}\) is a generalised polynomial (see [26] for the state of the art on sparse generalised polynomials arising from linear recurrence sequences). In this particular case, it is known that the first-order theory of \((\mathbb{N},+,1_{\mathrm{Fib}})\) is decidable (e.g.by [5]). It would be interesting to determine the extent to which this result extends to other sparse \(\{0,1\}\)-valued generalised polynomial sequences.

1.4.4 Ranges of sequences↩︎

Let \(f \colon \mathbb{Z}\to \mathbb{Z}\) be a polynomial of degree at least \(2\). We have already pointed out that multiplication is definable in the language of \((\mathbb{Z};<,+,f)\). In fact, it is a standard observation that somewhat more is true: Multiplication is definable in the language of \((\mathbb{Z};<,+,f(\mathbb{Z}))\), i.e., in Presburger arithmetic extended by the unary predicate corresponding to the image of \(f\). As a representative example, we note the for \(f(n) = n^3\) we have \(f(n+2)-2f(n+1)+f(n) = 6n+6\) so for sufficiently large \(n,m\) we have \(m = f(n)\) if and only if there exist \(m',m''\) such that \(m,m',m''\) are consecutive elements of \(f(\mathbb{Z})\) and \(m'' - 2m' + m = 6n+6\). Thus, \(f\) is definable in \((\mathbb{Z};<,+,f(\mathbb{Z}))\), which brings us back to the result mentioned before.

One is naturally lead to ask if analogous generalisations apply to our results concerning generalised polynomials. This seems highly plausible (in the case of generalised polynomials with at least quadratic growth). Indeed, for \(g\) given by ?? , applying a similar discrete differentiation argument as mentioned in the example above, one can define in \((\mathbb{Z};<,+,g(\mathbb{Z}))\) a map \(g'\) that is reminiscent of \(g\), and it seems likely that \(\mathrm{Th}(\mathbb{Z};<,+,g')\) should be undecidable. Unfortunately, thus obtained \(g'\) would be rather less elegant than \(g\), leading to some rather unwieldy computations which we do not pursue at this time.

Acknowledgements↩︎

The author is grateful to James Worrell, George Kenison, Andrew Scoones, Mahsa Shirmohammadi, and Bertrand Teguia for many helpful conversations. The author is supported by UKRI Fellowship EP/X033813/1. For the purpose of open access, the authors have applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising.

Notation↩︎

We let \(\mathbb{N}= \{1,2,\dots\}\) denote the set of positive integers and put \(\mathbb{N}= \mathbb{N}\cup \{0\}\). We also let \(\mathbb{Z}^* = \mathbb{Z}\setminus \{0\}\) denote the set of non-zero integers.

For \(x \in \mathbb{R}\), we let \(\left\lfloor x \right\rfloor = \max\left\{ n \in \mathbb{Z} \ \middle| \ n \leq x \right\}\) denote the integer part of \(x\) (also known as the floor of \(x\)), \(\left\lceil x \right\rceil = - \left\lceil -x \right\rceil\) the ceiling of \(x\), and \(\left\lfloor x \right\rceil = \left\lfloor x+1/2 \right\rfloor\) the best integer approximation. Similarly, we let \(\{x\} = x - \left\lfloor x \right\rfloor \in [0,1)\) and \(\left< x \right> = x - \left\lfloor x \right\rceil \in [-1/2,1/2)\). Finally, we let \(\left\lVert x \right\rVert_{\mathbb{R}/\mathbb{Z}} = \left|\left< x \right> \right| = \min\left\{ \left|x-n\right| \ \middle| \ n \in \mathbb{Z} \right\} \in [0,1/2]\) denote the circle norm of \(x\). We will primarily use the operations \(\left\lfloor \cdot \right\rceil\) and \(\left< \cdot \right>\) in order to avoid the practical difficulties resulting from the fact that the more commonly used operations \(\left\lfloor \cdot \right\rfloor\) and \(\{\cdot\}\) are discontinuous at \(0\).

2 Multiplication↩︎

In this section we show that the first-order theory of \((\mathbb{Z};+,1)\) extended by “poor man’s multiplication” is undecidable. Consider a set \(\mathcal{Q}\subseteq\mathbb{Z}^4\) satisfying the following properties:

  1. If \(\vec{x} \in \mathcal{Q}\), then there exist \(k,l,m \in \mathbb{Z}\) such that \(\vec{x} = (m,km,lm,klm)\).

  2. For each finite set \(F \subseteq\mathbb{Z}^2\) there exists \(m \in \mathbb{Z}^*\) such that for all \((k,l) \in F\) we have \((m,km,lm,klm) \in \mathcal{Q}\).

Proposition 1. For any set \(\mathcal{Q}\subseteq\mathbb{Z}^4\) satisfying [it:Q:A] and [it:Q:B], the first-order theory of \((\mathbb{Z};+,1,\mathcal{Q})\) is undecidable.

We can use \(\mathcal{Q}\) to define a family of partial binary operators \((\times_m)_{m \in \mathbb{Z}^*}\) which are specified by the rule \(a \times_m b = c\) if and only if \((m,a,b,c) \in \mathcal{Q}\). (Formally, we simply mean that we define a partial function from \(\mathbb{Z}^3\) to \(\mathbb{Z}\), but we find the operator notation more evocative.) Note that property [it:Q:A] ensures that for each \((m,a,b) \in \mathbb{Z}^* \times \mathbb{Z}^2\) there exists at most one \(c \in \mathbb{Z}\) such that \((m,a,b,c) \in \mathcal{Q}\), so this definition is well-posed. In fact, \(c\) is necessarily given by \(c = ab/m\), so \(a \times_m b = ab/m\) if \(ab/m \in \mathbb{Z}\) and \((m,a,b,ab/m) \in \mathcal{Q}\), and \(a \times_m b\) is undefined otherwise. We let \(\operatorname{dom}(\times_m)\) denote the domain of \(\times_m\), that is, the set of pairs \((a,b) \in \mathbb{Z}^*\) such that there exists \(c \in \mathbb{Z}^*\) with \((m,a,b,c) \in \mathcal{Q}\). An equivalent way of expressing property [it:Q:B] is that for each finite subset \(F\) of \(\mathbb{Z}^2\), the dilated copy \(m F\) of \(F\) is contained in the domain of \(\times_m\) for at least one \(m \in \mathbb{Z}^*\). We point out that the operation \(\times_m\) is commutative, associative and distributive in the sense that we have \[\begin{align} a \times_m b &= b \times_m a && (= ab/m),\\ (a \times_m b) \times_m c &= a \times_m (b \times_m c) && (=abc/m^2),\\ (a+b) \times_m c &= a \times_m c + b \times_m c && (=(a+b)c/m), \end{align}\] assuming that all terms are defined. However, we cannot rule out the possibility that e.g.\(a \times_m b\) is defined while \(b \times_m a\) is not.

For a term \(t\) in the language of \((\mathbb{Z}; 1,+,-,\times)\) with variables \(x_1,x_2,\dots,x_s\) and for \(m \in \mathbb{Z}^*\) we define \(t_m\) to be the expression obtained from \(t\) by replacing each instance of \(\times\) with \(\times_m\). For instance, if \[\begin{align} \label{eq:mult:34:01} t = (x_1+x_2) \times (x_2 + x_3 + x_3) + (x_1 \times x_1) \times x_3 + 2, \end{align}\tag{6}\] (using the shorthands \(x + y + z = (x+y)+z\) and \(2 = 1+1\)), then \(t_m\) is given by \[\begin{align} t_m = (x_1+x_2) \times_m (x_2 + x_3 + x_3) + (x_1 \times_m x_1) \times_m x_3 + 2. \end{align}\] For each polynomial \(p \in \mathbb{Z}[x_1,x_2,\dots,x_s]\), pick once and for all a term \(t^{(p)}\) which represents it. For instance, the term \(t\) given by 6 represents the polynomial \(p\) given by \[\begin{align} p(x_1,x_2,x_3) &= (x_1+x_2)(x_2+2x_3) + x_1^2 x_3 + 2 \\ &= x_1 x_2 + x_2^2 + 2 x_1 x_3 + 2 x_2 x_3 + x_1^2 x_3 + 2. \end{align}\] We then let \(p_m\) denote the partial function on \(\mathbb{Z}^s\) represented by \(t^{(p)}_m\).

Lemma 1. Let \(p \in \mathbb{Z}[x_1,x_2,\dots,x_s]\). Then there exists a finite family \(\mathcal{F}(p)\) of pairs of polynomials in \(\mathbb{Z}[x_1,x_2,\dots,x_s]\) such that for \(m \in \mathbb{Z}^*\) and \(n_1,n_2,\dots,n_s \in \mathbb{Z}\), the value \(p_m(mn_1,mn_2,\dots,mn_s)\) is defined provided that \(m q(n_1,n_2,\dots,n_s) \in \operatorname{dom}(\times_m)\) for all \(q \in \mathcal{F}(p)\), and is equal to \(m p(n_1,n_2,\dots,n_s)\) if defined.

Proof. We proceed by structural induction. If \(p(x_1,x_2,\dots,x_s) = x_i\) for some \(1 \leq i \leq s\), then the claim holds with \(\mathcal{F}(p) = \emptyset\), and likewise if \(p(x_1,x_2,\dots,x_s) = 1\). Thus we may assume that \(p\) is either the sum \(p^{(1)} + p^{(2)}\) or the product \(p^{(1)} \times p^{(2)}\) of simpler polynomials. If \(p = p^{(1)} + p^{(2)}\), we take \(\mathcal{F}(p) = \mathcal{F}(p^{(1)}) \cup \mathcal{F}(p^{(2)})\) and note that by the inductive assumption for \(p^{(1)}\) and \(p^{(2)}\) we have \[\begin{align} p_m(mn_1,m n_2,\dots, m n_s) &= p^{(1)}_m(mn_1,m n_2,\dots, m n_s) + p^{(2)}_m(mn_1,m n_2,\dots, m n_s) \\ &= m p^{(1)}(n_1,n_2,\dots, n_s) + m p^{(2)}(n_1,n_2,\dots,n_s) \\ &= m p(n_1,n_2,\dots, n_s), \end{align}\] where defined. Similarly, if \(p = p^{(1)} \times p^{(2)}\) we take \(\mathcal{F}(p) = \mathcal{F}(p^{(1)}) \cup \mathcal{F}(p^{(2)}) \cup \{( p^{(1)}, p^{(2)})\}\) and compute that \[\begin{align} p_m(mn_1,m n_2,\dots, m n_s) &= p^{(1)}_m(mn_1,m n_2,\dots, m n_s) \times_m p^{(2)}_m(mn_1,m n_2,\dots, m n_s) \\ &= m p^{(1)}(n_1,n_2,\dots, n_s) \times_m m p^{(2)}(n_1,n_2,\dots,n_s) \\ &= m p(n_1,n_2,\dots, n_s), \end{align}\] where defined. ◻

Proof of Prop.1. Let \(p \in \mathbb{Z}[x_1,x_2,\dots,x_s]\) be a polynomial. It follows from Lemma 1 that for \(n_1,n_2,\dots,n_s \in \mathbb{Z}\) and \(m \in \mathbb{Z}^*\) such that \(m q(n_1,n_2,\dots,n_s) \in \operatorname{dom}(\times_m)\) for all \(q \in \mathcal{F}(p)\), we have the equivalence \[\begin{align} p(n_1,n_2,\dots,n_s) &= 0 & \text{ if and only if} && p_m(mn_1,mn_2,\dots,mn_s) &= 0. \end{align}\] Since property [it:Q:B] guarantees the existence of admissible \(m \in \mathbb{Z}^*\), we conclude that \(p(n_1,n_2,\dots,n_s) = 0\) if and only if there exists \(m \in \mathbb{Z}^*\) such that \(p_m(n_1,n_2,\dots,n_s) = 0\). The last condition is expressible in the first-order language of \((\mathbb{Z};+,1,\mathcal{Q})\). Thus, if the first-order theory of \((\mathbb{Z};+,1,\mathcal{Q})\) was decidable, then it would also be decidable if a given polynomial equation is solvable, which is well-known to be false (see e.g.[27]). ◻

3 Proof of Theorem 1↩︎

3.1 Setup↩︎

Throughout this section, we let \(g\colon \mathbb{Z}\to \mathbb{Z}\) be the generalised polynomial given by \[\begin{align} \label{eq:27:def-g} && g(n) &= \left\lfloor \beta n \left\lfloor \alpha n \right\rceil \right\rceil,& n \in \mathbb{Z}, \end{align}\tag{7}\] where \(\alpha,\beta\in \mathbb{R}\), \(\beta \neq 0\) and \(1,\alpha,\alpha^2\) are linearly independent over \(\mathbb{Q}\). For concreteness, we assume that \(\alpha,\beta> 0\); the argument in the other cases is entirely analogous. Replacing \(g(n)\) with \(g(an)\) for suitably chosen \(a \in \mathbb{N}\), we may freely assume that \(\beta\in \mathbb{Z}\) or \(\beta\in \mathbb{R}\setminus \mathbb{Q}\).

3.2 Differentiation↩︎

A key feature of generalised polynomials which we will make use of is that their iterated discrete derivatives frequently vanish. For a map \(f \colon \mathbb{Z}\to \mathbb{Z}\) and \(m \in \mathbb{Z}\) we define the discrete derivative \(\operatorname{\mathrlap{\;\phantom{\circ}}\operatorname{\Delta}}_m f \colon \mathbb{Z}\to \mathbb{Z}\) by \[\begin{align} \label{eq:DeltaN-f} \operatorname{\mathrlap{\;\phantom{\circ}}\operatorname{\Delta}}_m f(n) &= f(n+m) - f(n), \end{align}\tag{8}\] and the symmetric discrete derivative \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_m f \colon \mathbb{Z}\to \mathbb{Z}\) by \[\begin{align} \label{eq:Delta-f} \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_m f(n) &= \operatorname{\mathrlap{\;\phantom{\circ}}\operatorname{\Delta}}_m \operatorname{\mathrlap{\;\phantom{\circ}}\operatorname{\Delta}}_n f(0) = f(n+m) - f(n)-f(m) + f(0). \end{align}\tag{9}\] As an illustrative example, we point out that if \(f\) is a polynomial sequence of degree \(d \geq 1\) and \(m \neq 0\), then \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_m f\) is a polynomial of degree \(d-1\) (and if \(f\) is constant, then \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_m f\) is identically zero). In order to avoid the excessive use of subscripts, and in order to emphasise the symmetry \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_m f(n) = \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_n f(m)\), for each \(r \geq 1\) and \(n_0,n_1,\dots,n_r \in \mathbb{Z}\) we let \[\begin{align} \label{eq:Delta94r-f} \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^r f(n_0,n_1,\dots,n_r) &= \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_{n_r} \dots \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}_{n_1} f(n_0). \end{align}\tag{10}\]

Lemma 2. There exists a constant \(C \geq 1\), dependent only on \(\alpha\) and \(\beta\), such that the following is true. Let \(n_0,n_1,n_2 \in \mathbb{N}\) satisfy \(n_0,n_1/n_0,n_2/n_1 \geq C\). Then \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2 g(n_0,n_1,n_2) = 0\) if and only if

  1. for all \(I \subseteq\{0,1,2\}\) we have \(\left\lfloor \sum_{i \in I} \left< \alpha n_i \right> \right\rceil = 0\);

  2. letting \(\gamma_{I} := \sum_{i,j \in I} \left< \beta n_i \left\lfloor \alpha n_j \right\rceil \right>\) for \(I \subseteq\{0,1,2\}\), we have \[\left\lfloor \gamma_{\{0,1,2\}} \right\rceil = \left\lfloor \gamma_{\{0,1\}} \right\rceil + \left\lfloor \gamma_{\{1,2\}} \right\rceil + \left\lfloor \gamma_{\{2,0\}} \right\rceil.\]

Henceforth, we let \(C\) denote a constant that is admissible in Lemma 2 above.

Proof. We can explicitly compute that for \(n,m \in \mathbb{N}\) we have \[\begin{align} \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n,m) &= \left\lfloor \beta(n+m) \big( \left\lfloor \alpha n \right\rceil + \left\lfloor \alpha m \right\rceil + e(n,m) \big) \right\rceil - \left\lfloor \beta n \left\lfloor \alpha n \right\rceil \right\rceil - \left\lfloor \beta m \left\lfloor \alpha m \right\rceil \right\rceil \\ &= \left\lfloor \beta n \left\lfloor \alpha m \right\rceil \right\rceil + \left\lfloor \beta m \left\lfloor \alpha n \right\rceil \right\rceil + \left\lfloor \beta(n+m) \right\rceil e(n,m) + f(n,m), \end{align}\] where \(e(n,m) \in \{-1,0,1\}\) and \(f(n,m) \in \{-4,-3,\dots,3,4\}\) are given by \[\begin{align} e(n,m) &= \left\lfloor \alpha n + \alpha m \right\rceil - \left\lfloor \alpha n \right\rceil - \left\lfloor \alpha m \right\rceil, \\ f(n,m) &= \left\lfloor \beta(n+m) \big( \left\lfloor \alpha n \right\rceil + \left\lfloor \alpha m \right\rceil + e(n,m) \big) \right\rceil - \left\lfloor \beta n \left\lfloor \alpha n \right\rceil \right\rceil - \left\lfloor \beta m \left\lfloor \alpha m \right\rceil \right\rceil \\& \phantom{=} - \left\lfloor \beta n \left\lfloor \alpha m \right\rceil \right\rceil - \left\lfloor \beta m \left\lfloor \alpha n \right\rceil \right\rceil - \left\lfloor \beta(n+m) \right\rceil e(n,m). \end{align}\] Similarly, we can compute that for \(n_0,n_1,n_2 \in \mathbb{N}\) we have \[\begin{align} g(n_0+n_1+n_2) &= g(n_0) + g(n_1) + g(n_2) + \textstyle{\sum_{i \neq j}} \left\lfloor \beta n_i \left\lfloor \alpha n_j \right\rceil \right\rceil \\& \phantom{=} + \left\lfloor \beta(n_0+n_1+n_2)e(n_0,n_1,n_2) \right\rceil + f(n_0,n_1,n_2), \end{align}\] where \(e(n_0,n_1,n_2) \in \{-1,0,1\}\) is given by \[\begin{align} e(n_0,n_1,n_2) &= \left\lfloor \alpha n_0 + \alpha n_1 + \alpha n_2 \right\rceil - \left\lfloor \alpha n_0 \right\rceil - \left\lfloor \alpha n_1 \right\rceil - \left\lfloor \alpha n_2 \right\rceil, \\ &= \left\lfloor \left< \alpha n_0 \right> + \left< \alpha n_1 \right> + \left< \alpha n_2 \right> \right\rceil \end{align}\] and \(f(n_0,n_1,n_2)\) is given by a similar formula. We stress that \(\left|e(n_0,n_1,n_2)\right| \leq 1\), which follows from the fact that \(-3/2 < \left< \alpha n_0 \right> + \left< \alpha n_1 \right> + \left< \alpha n_2 \right> < 3/2\).

Suppose that \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2 g(n_0,n_1,n_2) = 0\). Combining the formulas above, we see that \[\begin{align} 0 = \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2 g(n_0,n_1,n_2) &= \beta (n_0+n_1+n_2)e(n_0,n_1,n_2) - \beta(n_0+n_1)e(n_0,n_1) \\& \phantom{=} - \beta(n_1+n_2)e(n_1,n_2) - \beta(n_2+n_0)e(n_2,n_0) + O(1). \end{align}\] Rearranging and dividing by \(\beta\) we obtain \[\begin{align} 0 &= n_2 \left( e(n_0,n_1,n_2) - e(n_1,n_2) - e(n_2,n_0) \right) \\& \phantom{=} + n_1 \left( e(n_0,n_1,n_2) - e(n_0,n_1) - e(n_1,n_2) \right) \\& \phantom{=} + n_0 \left( e(n_0,n_1,n_2) - e(n_0,n_1) - e(n_2,n_0) \right) + O(1). \end{align}\] Provided that the constant \(C\) was chosen large enough, it follows that the contributions from \(n_2\), \(n_1\) and \(n_0\) above must all be zero, meaning that \[\begin{align} e(n_0,n_1,n_2) - e(n_1,n_2) - e(n_2,n_0) &= 0,\\ e(n_0,n_1,n_2) - e(n_0,n_1) - e(n_1,n_2) &= 0,\\ e(n_0,n_1,n_2) - e(n_0,n_1) - e(n_2,n_0) &= 0. \end{align}\] Comparing the three equations above, we infer that for some \(e \in \{-1,0,1\}\) we have \[\begin{align} e(n_0,n_1) = e(n_1,n_2) = e(n_2,n_0) &= e, & e(n_0,n_1,n_2) = 2e. \end{align}\] Since \(e(n_0,n_1,n_2) \in \{-1,0,1\}\), it follows that \(e = 0\). This implies condition [cond:lem:D942610:I]. In order to obtain [cond:lem:D942610:II], we expand the equality \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2 g(n_0,n_1,n_2) = 0\) (using the simplifications resulting from [cond:lem:D942610:I]) and replace each instance of \(\beta n_i \left\lfloor \alpha n_j \right\rceil\) with \(\left< \beta n_i \left\lfloor \alpha n_j \right\rceil \right> + \left\lfloor \beta n_i \left\lfloor \alpha n_j \right\rceil \right\rceil\).

The proof of the converse direction follows along similar lines. Indeed, as alluded to above, assuming condition [cond:lem:D942610:I], an elementary computation shows that the condition \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2 g(n_0,n_1,n_2) = 0\) is equivalent to condition [cond:lem:D942610:II]. ◻

3.3 Fractional parts↩︎

Our next goal is to express in the first-order logic of the theory under consideration the condition that the circle norm \(\left\lVert \alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}}\) is small for some \(n \in \mathbb{N}\). The following lemma will be helpful.

Lemma 3. Let \(n_0,n_1 \in \mathbb{N}\) satisfy \(n_0,n_1/n_0 \geq C\). Then the following conditions are equivalent:

  1. there exists \(n_2 \in \mathbb{N}\) with \(n_2 \geq C n_1\) such that \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2 g(n_0,n_1,n_2) = 0\);

  2. \(\left| \left< \alpha n_0 \right> + \left< \alpha n_1 \right> \right| < 1/2\);

Proof. The implication [cond:lem:def-mu:I] \(\Rightarrow\) [cond:lem:def-mu:II] is already contained in Lemma 2, so we only need to verify the converse implication [cond:lem:def-mu:II] \(\Rightarrow\) [cond:lem:def-mu:I]. It follows e.g.from the characterisation of distributions of generalised polynomials in [19] that there exists a sequence \((n_{2}^{(k)})_{k=1}^\infty\) (with \(n_2^{(k)} \to \infty\) as \(k \to \infty\)) such that all fractional parts appearing in the conditions in Lemma 2 and involving \(n_2^{(k)}\) tend to \(0\) as \(k \to \infty\), i.e., \[\begin{align} \left< \alpha n_2^{(k)} \right> &\to 0, & \left< \beta n_2^{(k)} \left\lfloor \alpha n_2^{(k)} \right\rceil \right> &\to 0, \\ \left< \beta n_0 \left\lfloor \alpha n_2^{(k)} \right\rceil \right> &\to 0, & \left< \beta n_1 \left\lfloor \alpha n_2^{(k)} \right\rceil \right> &\to 0, \\ \left< \beta n_2^{(k)} \left\lfloor \alpha n_0 \right\rceil \right> &\to 0, & \left< \beta n_2^{(k)} \left\lfloor \alpha n_1 \right\rceil \right> &\to 0. \end{align}\] As a consequence, for all sufficiently large \(k\) we have \[\begin{align} \left\lfloor \left< \alpha n_0 \right> + \left< \alpha n_1 \right> + \langle \alpha n_2^{(k)} \rangle \right\rceil &= \left\lfloor \left< \alpha n_0 \right> + \left< \alpha n_1 \right> \right\rceil = 0, \end{align}\] and by the same token \[\begin{align} \left\lfloor \left< \alpha n_0 \right> + \langle \alpha n_2^{(k)} \rangle \right\rceil = \left\lfloor \left< \alpha n_1 \right> + \langle \alpha n_2^{(k)} \rangle \right\rceil = 0 \end{align}\] Similarly, with \(\gamma_{I}^{(k)}\) defined as in Lemma 2[cond:lem:D942610:II] with \(n_2^{(k)}\) in place of \(n_2\), for sufficiently large \(k\) we have \[\begin{align} \left\lfloor \gamma_{\{0,1,2\}}^{(k)} \right\rceil & = \left\lfloor \gamma_{\{0,1\}}^{(k)} \right\rceil,& \left\lfloor \gamma_{\{0,2\}}^{(k)} \right\rceil & = \left\lfloor \gamma_{\{0\}}^{(k)} \right\rceil = 0, & \left\lfloor \gamma_{\{1,2\}}^{(k)} \right\rceil & = \left\lfloor \gamma_{\{1\}}^{(k)} \right\rceil = 0. \end{align}\] In particular, it follows that we have \[\begin{align} \left\lfloor \gamma_{\{0,1,2\}}^{(k)} \right\rceil & = \left\lfloor \gamma_{\{0,1\}}^{(k)} \right\rceil + \left\lfloor \gamma_{\{1,2\}}^{(k)} \right\rceil + \left\lfloor \gamma_{\{2,0\}}^{(k)} \right\rceil. \end{align}\] Thus, Lemma 2 implies that \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2 g(n_0,n_1,n_2^{(k)}) = 0\) for sufficiently large \(k\). ◻

Motivated by Lemma 3, we introduce the following properties \[\begin{align} \tag{11} \mu(n_0,n_1): && (\exists \;n_2)& \;(n_2 \geq C n_1) \wedge (\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}^2g(n_0,n_1,n_2) = 0) && (n_0,n_1 \in \mathbb{N});\\ \tag{12} \psi(m,N): && (\forall \;n )& \;(n \leq N) \Rightarrow \mu(n,m) && (m,N \in \mathbb{N}). \end{align}\]

Lemma 4. For each \(\varepsilon> 0\) there exists \(N \in \mathbb{N}\) such that for all \(m \in \mathbb{N}\), if \(\psi(m,N)\) holds, then \(\left\lVert \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\). Conversely, for each \(N \in \mathbb{N}\) there exists \(\varepsilon> 0\) such that if \(\left\lVert \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\), then \(\psi(m,N)\) holds.

Proof. It follows from Lemma 3 that \(\psi(m,N)\) holds if and only if \[\begin{align} \label{eq:88:01} -\frac{1}{2}-\min_{n \leq N} \left< \alpha n \right> < \left< \alpha m \right> < \frac{1}{2}-\max_{n \leq N} \left< \alpha n \right> . \end{align}\tag{13}\] It remains to notice that the expression on the left side of 13 tends to \(0\) from below as \(N \to \infty\), and the analogous statement applies to the expression on the right side. ◻

3.4 Divisibility↩︎

We will next construct a first-order formula that expresses a condition closely related to divisibility of integers. We will need the following technical lemma, which is the only place where we make use of the assumption that \(\alpha^2\) is linearly independent of \(1,\alpha\) over \(\mathbb{Q}\).

Lemma 5. Let \(\alpha\in \mathbb{R}\) be such that \(1,\alpha,\alpha^2\) are linearly independent over \(\mathbb{Q}\). Let \(\theta = \frac{a + \alpha b}{c + \alpha d}\) with \(a,b,c,d \in \mathbb{Z}\), \(d \neq 0\) and \(\theta \not \in \mathbb{Q}\). Then the sequence \(\left( \left< \alpha n \right> , \left< \alpha\left\lfloor \theta n \right\rceil \right> \right)_{n \in \mathbb{Z}}\) is equidistributed with respect to the measure on \([-1/2,1/2)\) which is the push-forward of the normalised Lebesgue measure on \(\{0,1,\dots,d-1\} \times [-1/2,1/2) \times [-1/2,1/2)\) through the map \[\begin{align} \label{eq:transfer} (r,x,y) \mapsto \left( \left< dx + \alpha r \right> , \left< b x - c y + \alpha\theta r - \alpha\left< d y + \theta r \right> \right> \right). \end{align}\tag{14}\] In particular, the point \((0,0)\) belongs to the interior of the closure of the orbit \[\left\{ \big( \left< \alpha n \right> , \left< \alpha\left\lfloor \theta n \right\rceil \right> \big) \ \middle| \ n \in \mathbb{Z} \right\} .\]

Proof. Let \(n \in \mathbb{Z}\) and write \(n = d n' + r(n)\) where \(n' \in \mathbb{Z}\) and \(0 \leq r(n) < d\). Put also \(x(n) = \left< \alpha n' \right>\) and \(y(n) = \left< \theta n' \right>\). Since \(1,\alpha,\theta\) are linearly independent over \(\mathbb{Q}\), the sequence \(\left( r(n),x(n),y(n) \right)_{n \in \mathbb{Z}}\) is equidistributed in \(\{0,1,\dots,d-1\} \times [-1/2,1/2) \times [-1/2,1/2)\). Note that \(\alpha\theta d = a + \alpha b - \theta c\). We can compute that \[\begin{align} \left< \alpha n \right> &= \left< d x(n) + \alpha r(n) \right> ,\\ \left< \alpha\left\lfloor \theta n \right\rceil \right> &= \left< \alpha\theta n - \alpha\left< \theta n \right> \right> \\&= \left< (a + \alpha b - \theta c)n' + (\alpha\theta r(n)) - \alpha\left< \theta dn' + \theta r(n) \right> \right> \\&= \left< b x(n) - c y(n) + \alpha\theta r(n) - \alpha\left< d y(n) + \theta r(n) \right> \right> \end{align}\] This proves the first part of the statement. For the second part, we may take \(r = 0\) in 14 ; it will suffice to show that \((0,0)\) belongs to the interior of the image of \([-1/2,1/2) \times [-1/2,1/2)\) through the map given by \[\begin{align} \label{eq:transfer-2} (x,y) \mapsto \left( \left< dx \right> , \left< b x - c y - \alpha\left< d y \right> \right> \right). \end{align}\tag{15}\] This can be accomplished, for instance, by noting that the point \((0,0)\) is non-singular and is mapped to \((0,0)\). ◻

Lemma 6. Let \(n,n' \in \mathbb{N}\) and let \(h \geq 1\). Suppose that for each \(\varepsilon' > 0\) there exists \(\varepsilon> 0\) such that for each \(m \in \mathbb{N}\) with \(\left\lVert \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\) there exists \(m' \in \mathbb{N}\) with \(\left\lVert \alpha m' \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon'\) such that \(\left| \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n,m') - \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n',m)\right| \leq h\). Then \(\left\lfloor \alpha n' \right\rceil/\left\lfloor \alpha n \right\rceil = n'/n \in \mathbb{N}\).

Proof. Our first goal is to show that \[\begin{align} \theta &:= \frac{\alpha n' + \left\lfloor \alpha n' \right\rceil}{\alpha n + \left\lfloor \alpha n \right\rceil} \end{align}\] is rational. For the sake of contradiction, suppose otherwise. It follows from Lemma 5 that for any \(x \in \mathbb{R}\) with sufficiently small absolute value we can find a sequence \((m_i)_{i=1}^\infty\) such that \(\left< \alpha m_i \right> \to 0\) and \(\left< \alpha\left\lfloor \theta m_i \right\rceil \right> \to x\) as \(i \to \infty\). It follows from the assumptions that we can find a sequence \((m_i')_{i=1}^\infty\) with \(\left< \alpha m_i' \right> \to 0\) such that for all \(i \in \mathbb{N}\) we have \[\begin{align} \label{eq:82:01} \left| \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n,m'_i) - \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n',m_i)\right| \leq h. \end{align}\tag{16}\] Restricting our attention to \(i\) such that \(\left< \alpha m_i \right>\) and \(\left< \alpha m_i' \right>\) are sufficiently small with respect to \(n\) and \(n'\), we conclude from 16 that \[\begin{align} \label{eq:82:02} m_i' \left\lfloor \alpha n \right\rceil + n \left\lfloor \alpha m_i' \right\rceil = m_i \left\lfloor \alpha n' \right\rceil + n' \left\lfloor \alpha m_i \right\rceil + O(h). \end{align}\tag{17}\] Computing \(m_i'\) from 17 we obtain \[\begin{align} \label{eq:82:03} m_i' = \theta m_i + O(h+n+n'). \end{align}\tag{18}\] Thus, there is a bounded sequence \((c_i)_{i=1}^\infty\) (with \(\left|c_i\right| = O(h+n+n')\)) such that \(m_i' = \left\lfloor \theta m_i \right\rceil + c_i\) for all \(i \in \mathbb{N}\). In particular, we have \[\begin{align} \label{eq:82:04} \alpha m_i' = \alpha\left\lfloor \theta m_i \right\rceil + \alpha c_i. \end{align}\tag{19}\] Passing to a subsequence where \(c_i = c\) is constant, taking 19 modulo \(1\) and letting \(i \to \infty\), we conclude that \[\begin{align} \label{eq:82:05} 0 \equiv x + \alpha c \bmod{1}. \end{align}\tag{20}\] Recall that \(x\) was arbitrary, subject only to the constraint that \(\left|x\right|\) is sufficiently small. Picking any \(x \in \mathbb{R}\setminus (\mathbb{Z}+ \alpha\mathbb{Z})\) we reach a contradiction, proving that \(\theta \in \mathbb{Q}\).

Next, we observe that \(\theta\) is an integer. Indeed, if we had \(\theta = {a}/{b} \in \mathbb{Q}\setminus \mathbb{Z}\) with \(\gcd(a,b) = 1\), then we could run the same argument as above with \(x = 1/b\), again leading to a contradiction. Since the argument is fully analogous, we omit the details. Finally, since \(\alpha\) is irrational, \(\theta \in \mathbb{N}\) implies that \({\left\lfloor \alpha n' \right\rceil}/{\left\lfloor \alpha n \right\rceil}= {n'}/{n} = \theta.\) ◻

It is not hard to see that the converse to Lemma 6 also holds. Indeed, if \(n,n' \in \mathbb{N}\) are such that \(\left\lfloor \alpha n' \right\rceil/\left\lfloor \alpha n \right\rceil = n'/n =: t\), then for each sufficiently small \(\varepsilon> 0\) and each \(m \in \mathbb{N}\) and \(m' := tm\) with \(\left\lVert \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\) we have \[\begin{align} &&\beta m' \left\lfloor \alpha n \right\rceil = \beta m \left\lfloor \alpha n' \right\rceil &= t \beta m \left\lfloor \alpha n \right\rceil,&&\\ && \beta n \left\lfloor a m' \right\rceil = \beta n' \left\lfloor \alpha m \right\rceil &= t \beta n \left\lfloor a m \right\rceil,&&\\ \text{ and hence } && \left| \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n,m') - \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n',m) \right| &\leq 2.&& \end{align}\] (Specifically, we need \(\varepsilon< 1/(2t)\) and \(\varepsilon+ t\left\lVert \alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}} < 1/2\) and \(t \varepsilon+ \left\lVert \alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}} < 1/2\).)

Motivated by Lemma 6 and the discussion above, we define \[\begin{align} \label{eq:def-delta} \delta(n,n'): &&& (\exists\;H)\;(\forall\;M')\;(\exists\;M)\;(\forall\;m)\; \psi(m,M) \Rightarrow \\ \nonumber &&& (\exists\;m')\;\psi(m',M') \wedge { \left| \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n,m') - \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(n',m)\right| \leq H } && (n,n' \in \mathbb{N}). \end{align}\tag{21}\]

Lemma 7. Let \(n,n' \in \mathbb{N}\). Then \(\delta(n,n')\) holds if and only if \(\left\lfloor \alpha n' \right\rceil/\left\lfloor \alpha n \right\rceil = n'/n \in \mathbb{N}\).

Proof. Follows by combining Lemma 6 with Lemma 4. ◻

For later reference, we point out that for \(n' = tn\) the condition \(\left\lfloor \alpha n' \right\rceil/\left\lfloor \alpha n \right\rceil = n'/n\) is equivalent to \(t \left\lVert \alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}} < 1/2\).

3.5 Arithmetic progressions↩︎

It follows from Lemma 6 that for each \(k \in \mathbb{N}\), the set of \(n \in \mathbb{N}\) for which \(\delta(k,n)\) holds is an arithmetic progression with step \(k\), starting at \(k\). For \(k \in \mathbb{N}\), let \(\ell(k)\) be the endpoint of the aforementioned arithmetic progression; more explicitly, \(\ell(k) = k \lfloor {1}/( 2\left\lVert \alpha k \right\rVert_{\mathbb{R}/\mathbb{Z}} ) \rfloor .\) Note that \(\ell(k)\) is the unique integer \(n \in \mathbb{N}\) such that \(\delta(k,n)\) and \(\neg \delta(k,n+k)\), and as a consequence the map \(\ell \colon \mathbb{N}\to \mathbb{N}\) is defined by a first-order formula.

The discussion above allows us to talk about arithmetic progressions in the first-order language under consideration. Indeed, for \(k,h \in \mathbb{N}\) with \(h \leq \ell(k)\) we can define the arithmetic progression \[\begin{align} \label{eq:def-P} P_{m,h} = \left\{ t m \ \middle| \1 \leq t \leq h/m \right\} = \left\{ n \in \mathbb{N} \ \middle| \\left( m \leq n \leq h \right) \wedge \delta(m, n) \right\} . \end{align}\tag{22}\] We are specifically interested in arithmetic progressions \(P\) such that the restriction of \(g\) to \(P\) coincides with a classical (as opposed to generalised) polynomial. This property can be expressed easily enough.

Lemma 8. Let \(m,h \in \mathbb{N}\) be such that \(3m \leq h \leq \ell(m)\). The following conditions are equivalent

  1. there exists a degree-\(2\) polynomial \(p\) such that \(g(n) = p(n)\) for all \(n \in P_{m,h}\);

  2. there exists \(a \in \mathbb{Z}^*\) such that \(\operatorname{\mathrlap{\;\phantom{\circ}}\operatorname{\Delta}}_m^2 g(n) = a\) for all \(n \in P_{m,h-2m}\).

Proof. The implication [cond:lem:char-good-prog:I] \(\Rightarrow\) [cond:lem:char-good-prog:II] holds because the application of \(\operatorname{\mathrlap{\;\phantom{\circ}}\operatorname{\Delta}}_m\) decreases the degree of a polynomial by \(1\). Conversely, if [cond:lem:char-good-prog:II] holds and \(p\) is the (unique) polynomial of degree at most \(2\) with \(p(tm) = g(tm)\) for \(t \in \{1,2,3\}\), then the leading coefficient of \(p\) is \(a/(2m^2) \neq 0\) and a simple inductive argument shows that \(g(n) = p(n)\) for all \(n \in P_{m,h}\). ◻

We let \(\pi(m,h)\) denote the assertion that the progression \(P_{m,h}\) is admissible: \[\begin{align} \label{eq:def-pi} \pi(m,h): &&& \big( 3m \leq h \leq \ell(m) \big) \wedge (\exists\;a \in \mathbb{Z}^*) \\ &&& \nonumber (\forall\;n \in P_{m,h-2m} )\;\operatorname{\mathrlap{\;\phantom{\circ}}\operatorname{\Delta}}_m^2 g(n) = a && (m,h \in \mathbb{N}). \end{align}\tag{23}\]

We will need to know that admissible arithmetic progressions can be arbitrarily long. The following lemma gives a sufficient condition.

Lemma 9. Let \(m, r \in \mathbb{N}\), \(r \geq 2\). Then \(\pi(m,rm)\) holds provided that \[\begin{align} \label{eq:64:01} \left\lVert \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \frac{1}{2r} & \left\lVert \beta m \left\lfloor \alpha m \right\rceil \right\rVert_{\mathbb{R}/\mathbb{Z}} < \frac{1}{2 r^2}. \end{align}\tag{24}\]

Proof. Recall from earlier discussion that the first condition in 24 implies that \(\ell(m) \geq rm\). For \(t \leq r\) we now have \[\begin{align} g(t m) = \left\lfloor \beta tm \left\lfloor \alpha tm \right\rceil \right\rceil = \left\lfloor \beta t^2 m \left\lfloor \alpha m \right\rceil \right\rceil = t^2 \left\lfloor \beta m \left\lfloor \alpha m \right\rceil \right\rceil. \end{align}\] Thus, for \(n \in P_{m,h}\) we have \(g(n) = (n/m)^2 g(m)\), which in particular is a polynomial in \(n\), as needed. ◻

Note that for each \(r\), there exist \(m \in \mathbb{N}\) such that 24 holds. This can be inferred for instance from another reference to [19].

3.6 Multiplicative quadruples↩︎

With a view towards applying the results of Section 2, we are now ready to define a set of multiplicative quadruples: \[\begin{align} \label{eq:def-Q} \mathcal{Q}= \left\{ (m,a,b,c) \in \mathbb{N}^4 \ \middle| \\begin{array}{ll} (\exists\;h \in \mathbb{N})\;\pi(m,h) \wedge a,b,a+b,m+c \in P_{m,h} \\ \wedge \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(m,c) = \operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}g(a,b) \end{array} \right\} \end{align}\tag{25}\] Since for a degree-\(2\) polynomial \(p(n) = a_2 n^2 + a_1 n_1 + a_0\) we have \(\operatorname{\mathrlap{\;\circ}\operatorname{\Delta}}p(n,m) = 2a_2 n m\), it follows directly from the definition of \(\mathcal{Q}\) that for all \((m,a,b,c) \in \mathcal{Q}\) we have \(mc = ab\), and consequently \(a = k m\), \(b = l m\) and \(c = klm\) for some \(k,l \in \mathbb{N}\). In fact, by the same argument we see that \((m,a,b,c) \in \mathcal{Q}\) if and only if \(a = km\), \(b = l m\) and \(c = klm\) for some \(k,l \in \mathbb{N}\) such that \(\pi(m,(kl+1)m)\) holds. It follows from Lemma 9 that, given \(k,l \in \mathbb{N}\), one can find \(m \in \mathbb{N}\) such that \(\pi(m,(kl+1)m)\) holds, and hence the set \(\mathcal{Q}\) given by 25 satisfies condition [it:Q:B] restricted to \(F \subseteq\mathbb{N}^2\). Let \[\mathcal{Q}_{\pm} = \left\{ (m,\sigma a, \tau b, \sigma\tau c) \ \middle| \(m,a,b,c) \in \mathcal{Q},\;\sigma,\tau = \pm 1 \right\} .\] It follows from discussion above that the set \(\mathcal{Q}_{\pm}\) satisfies conditions [it:Q:A] and [it:Q:B]. Hence, we are in position to apply Proposition 1, which completes the proof of Theorem 1.

4 Proof of Theorem 2↩︎

4.1 Setup↩︎

Throughout this section, we let \(g \colon \mathbb{Z}\to \{0,1\}\) be the generalised polynomial given by \[\begin{align} \label{eq:bohr:def-g} g(n) &= \begin{cases} 1 &\text{if } \left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \rho,\\ 0 &\text{otherwise,} \end{cases} & n \in \mathbb{Z}, \end{align}\tag{26}\] where \(\alpha\in \mathbb{R}\), \(\rho \in (0,1/4)\) and \(\alpha\) is not a linear combination of \(1\) and \(\rho\) with rational coefficients.

4.2 Small fractional parts↩︎

Our first step is to express the property that \(\left\lVert 2 \alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}}\) and \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}}\) are both small. Consider the following property: \[\begin{align} \label{eq:bohr:def-mu} \mu(m,N): && (\forall\;n \leq N)\;g(n+m) &= g(n) & (m,N \in \mathbb{N}). \end{align}\tag{27}\]

Lemma 10. For each \(\varepsilon> 0\) there exists \(N \in \mathbb{N}\) such that for all \(m \in \mathbb{N}\), if \(\mu(m,N)\) holds, then \(\left\lVert 2 \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\) and \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\). Conversely, for each \(N \in \mathbb{N}\) there exists \(\varepsilon> 0\) such that if \(\left\lVert 2\alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\) and \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\), then \(\mu(m,N)\) holds.

Proof. Let \(\varepsilon> 0\) and let \(N\) be a large integer, to be specified in the course of the argument. If \(\mu(m,N)\) holds, then for each \(n \leq N\) we have the equivalence \[\begin{align} \label{eq:bohr:64:01} \left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \rho & \Longleftrightarrow && \left\lVert \alpha n^2 + \left< 2\alpha m \right> n + \left< \alpha m^2 \right> \right\rVert_{\mathbb{R}/\mathbb{Z}} < \rho. \end{align}\tag{28}\] Put \(\beta:= \left< 2\alpha m \right>\) and \(\gamma:= \left< \alpha m^2 \right>\). Condition 28 implies in particular that there is no \(n \leq N\) such that \(\left< \alpha n^2 \right> \in (\frac{9}{10}\rho,\rho)\) and \(\left< \beta n + \gamma \right> \in (\frac{1}{10}\rho,\frac{2}{10}\rho)\). It follows that for some \(\rho'\) dependent only on \(\rho\), using the terminology of [28], the sequence \(\left( \left( \alpha n^2, \beta n + \gamma \right) \right)_{n=1}^N\) fails to be \(\rho'\)-equidistributed. Assuming, as we may, that \(N\) is sufficiently large, it follows (e.g.as a very special case of the quantitative Leibman theorem from [28]) that there exists \(k \in \mathbb{N}\), bounded by a constant \(K\) dependent only on \(\rho\), such that \(\left\lVert k \beta \right\rVert_{\mathbb{R}/\mathbb{Z}} \ll_\rho 1/N\). Thus (possibly after replacing \(k\) by a factor), we may write \(\beta = b/k + x/N\) where \(0 \leq b < k\) and \(\gcd(b,k) = 1\) and \(\left|x\right| \ll_\rho 1\).

We claim that \(\left|\gamma\right| \leq \varepsilon\). Suppose conversely that this was not the case, and for the sake of concreteness assume that \(\gamma\geq \varepsilon\). Then, assuming that \(N\) is sufficiently large, we use quantitative equidistribution results for polynomial sequences (e.g.[28]) to find \(n \in \mathbb{N}\) such that \[\begin{align} \label{eq:bohr:64:02} \left< \alpha n^2 \right> &\in (\rho - \varepsilon,\rho),& \left< \beta n \right> &\in (-\varepsilon/2,\varepsilon/2), & n \left|x\right| < (\varepsilon/2) N. \end{align}\tag{29}\] This implies \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \rho\) and \(\left\lVert \alpha n^2 + \beta n + \gamma \right\rVert_{\mathbb{R}/\mathbb{Z}} > \rho\), contradicting 28 .

Next, we claim that \(b = 0\), or equivalently, \(k = 1\). Suppose conversely. Arguing like above, we find \(n \in \mathbb{N}\) such that \[\begin{align} \label{eq:bohr:64:03} \left< \alpha n^2 \right> &\in (\rho - \varepsilon,\rho),& nb &\equiv 1 \bmod k,& & n \left|x\right| < (\varepsilon/2) N. \end{align}\tag{30}\] It follows that \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \rho\) and, assuming, as we may, that \(\varepsilon< 1/2K\), we have \[\begin{align} \left\lVert \alpha n^2 + \beta n + \gamma \right\rVert_{\mathbb{R}/\mathbb{Z}} \geq \rho + \frac{1}{k} - 2\varepsilon> \rho, \end{align}\] which again contradicts 28 , and completes the proof of the first statement.

We proceed to the proof of the second part of the lemma. Let \(N \in \mathbb{N}\) and put \[\begin{align} \label{eq:bohr:64:def-delta} \delta = \min \left\{ \left|\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} - \rho\right| \ \middle| \n \leq N \right\} > 0. \end{align}\tag{31}\] Suppose now that \(m\) is such that \[\begin{align} \label{eq:bohr:64:def-eps} \left\lVert 2 \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \frac{\delta}{10N} & \text{and} && \left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \frac{\delta}{10}. \end{align}\tag{32}\] For \(n \leq N\) we have \[\begin{align} \left| \left\lVert \alpha(n+m)^2 - \left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \right\rVert_{\mathbb{R}/\mathbb{Z}}\right|& \leq \left\lVert 2 \alpha m n + \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \\ &\leq N \left\lVert 2\alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} + \left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \leq \delta/5 < \delta, \end{align}\] and consequently \(g(n+m) = g(n)\). It follows that we may take \(\varepsilon= \delta/10N\). ◻

Our next step is to express the property that \(\left\lVert \alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}}\) is small, with no restrictions on \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}}\). This is easily achieved with the help of \(\mu(m,N)\). Define \[\begin{align} \label{eq:bohr:def-lambda} \lambda(m,N): && (\exists\;n)\;\mu(n,N) \wedge \mu(n+m,N) & & (m,N \in \mathbb{N}). \end{align}\tag{33}\]

Lemma 11. For each \(\varepsilon> 0\) there exists \(N \in \mathbb{N}\) such that for all \(m \in \mathbb{N}\), if \(\lambda(m,N)\) holds, then \(\left\lVert 2\alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\). Conversely, for each \(N \in \mathbb{N}\) there exists \(\varepsilon> 0\) such that if \(\left\lVert 2\alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\), then \(\lambda(m,N)\) holds.

Proof. Let \(\varepsilon> 0\) and pick \(N\) sufficiently large that \(\mu(m,N)\) implies \(\left\lVert 2\alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon/2\). Then \(\lambda(m,N)\) implies \(\left\lVert 2\alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\), as needed.

Conversely, let \(N\) be an integer. Pick \(\delta > 0\) sufficiently small that if for some \(n\) we have \(\left\lVert 2\alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}} < \delta\) and \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \delta\), then \(\lambda(n,N)\) holds. Let \(m\) be an integer such that \(\left\lVert 2\alpha m \right\rVert_{\mathbb{R}/\mathbb{Z}} < \delta/2\) and \(m > 2/\delta\). By Weyl’s equidistribution theorem, there exists \(n\) such that \[\begin{align} \label{eq:bohr:22:1} \left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \frac{\delta}{2} & \text{and} && \left|\left< 2\alpha n \right> + \left< \alpha m^2 \right> /m\right| < \frac{\delta}{2m}. \end{align}\tag{34}\] Then we have \[\begin{align} \label{eq:bohr:22:2} \left\lVert 2\alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}} & \leq \frac{1}{m} < \delta & \text{and} && \left\lVert 2\alpha(n+m) \right\rVert_{\mathbb{R}/\mathbb{Z}} & \leq \frac{1}{m} + \frac{\delta}{2} < \delta. \end{align}\tag{35}\] Of course, \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \delta\). It remains to estimate \(\left\lVert \alpha(n+m)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}}\). We have \[\begin{align} \left\lVert \alpha(n+m)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} & = \left\lVert \alpha n^2 + m \left( \left< 2\alpha n \right> + \left< \alpha m^2 \right> /m \right) \right\rVert_{\mathbb{R}/\mathbb{Z}} < \frac{\delta}{2} + m \frac{\delta}{2m} = \delta. \end{align}\] Thus, we have \(\mu(n,N)\) and \(\mu(n+m,N)\), as needed. It follows that we can take any \(\varepsilon> 0\) with \(\varepsilon< \delta/2\) and \(\varepsilon< \left\lVert \alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}}\) for all \(n \leq 2/\delta\). ◻

Next, we express the property that \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}}\) is small, which turns out to require more effort. Towards this goal, we introduce the following property.

\[\begin{align} \kappa(m,N): &&& (\exists\;M)\;(\forall\;h)\;\left( g(h) = 1 \right) \wedge \lambda(h,M) \Rightarrow \\ &&& (\forall\;L)\;(\exists\;n)\;\lambda(n,L) \wedge \mu(n,N) \wedge \left( g(h+m+n)=1 \right) &&(m,N \in \mathbb{N}). \end{align}\]

Lemma 12. For each \(\varepsilon> 0\) there exists \(N \in \mathbb{N}\) such that for all \(m \in \mathbb{N}\), if \(\kappa(m,N)\) holds, then \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\). Conversely, for each \(N \in \mathbb{N}\) there exists \(\varepsilon> 0\) such that if \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\), then \(\kappa(m,N)\) holds.

Proof. Let \(\varepsilon> 0\), and let \(N\) be such that \(\mu(n,N)\) implies \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon/2\) (which exists by Lemma 10). Pick \(m \in \mathbb{N}\) such that \(\kappa(m,N)\) holds; we aim to show that \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\). For the sake of contradiction, suppose that \(\left< \alpha m^2 \right> \geq \varepsilon\) (the case where \(\left< \alpha m^2 \right> \leq -\varepsilon\) is analogous). Let \(M\) be any integer admissible for \(\kappa(m,N)\), and let \(h\) be an integer such that \[\begin{align} \label{eq:bohr:58:1} \left< \alpha h^2 \right> &\in \left( \rho-\frac{\varepsilon}{8},\rho \right), & \left< 2\alpha h \right> & \in \left( 0,\frac{\varepsilon}{8m} \right) && \lambda(n,M), \end{align}\tag{36}\] which exists by Weyl’s equidistribution theorem. (Note that the second and the third condition in 36 can be satisfied by taking \(h\) such that \(\left< \alpha h \right>\) is sufficiently small and positive.) Bearing in mind Lemma 11, we infer from \(\kappa(m,N)\) that there exists a sequence of integers \((n_i)_{i=1}^\infty\) such that \[\begin{align} \tag{37}\lim_{i \to \infty} \left\lVert \alpha n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} &= 0, \\ \tag{38}\left\lVert \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \varepsilon/2 & \text{for all } i \in \mathbb{N},\\ \tag{39}\left\lVert \alpha(h+m+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \rho & \text{for all } i \in \mathbb{N}. \end{align}\] Passing to a subsequence, we may assume that \(\left< \alpha n_i^2 \right>\) converges to some limit \(\beta \in [-\varepsilon/2,\varepsilon/2]\) as \(i \to \infty\). Letting \(i \to \infty\) in 39 we arrive at a contradiction: \[\begin{align} \rho &\geq \lim_{i \to \infty} \left\lVert \alpha(h+m+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} = \left\lVert \alpha(h+m)^2 + \beta \right\rVert_{\mathbb{R}/\mathbb{Z}} \\& = \left\lVert \alpha h^2 + \left< 2\alpha h \right> m + \alpha m^2 + \beta \right\rVert_{\mathbb{R}/\mathbb{Z}} \geq \rho - \frac{\varepsilon}{8} - m \frac{\varepsilon}{8m} + \varepsilon- \frac{\varepsilon}{2} = \rho + \frac{\varepsilon}{8}. \end{align}\]

We now proceed to the proof of the converse implication. Let \(N\) be an integer and let \(\varepsilon> 0\) be sufficiently small that \(\left\lVert \alpha n^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < 2\varepsilon\) and \(\left\lVert 2\alpha n \right\rVert_{\mathbb{R}/\mathbb{Z}} < 2\varepsilon\) imply \(\mu(n,N)\). Pick \(M\) sufficiently large that \(\lambda(h,M)\) implies that \(\left\lVert 2\alpha h \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon/m\), and let \(h\) satisfy \(g(h) = 1\) and \(\lambda(h,M)\) (and hence in particular \(\left\lVert 2\alpha h \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon/m\)). We need to show that there exists a sequence of integers \((n_i)_{i=1}^\infty\) such that \[\begin{align} \tag{40}\lim_{i \to \infty} \left\lVert \alpha n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} &= 0, \\ \tag{41}\mu(n_i,N) &\text{ is true}& \text{for all sufficiently large } i \in \mathbb{N},\\ \tag{42}\left\lVert \alpha(h+m+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} &< \rho & \text{for all sufficiently large } i \in \mathbb{N}. \end{align}\] (Note that in 41 and 42 it is enough to consider sufficiently large \(i\) since we can always pass to a subsequence and achieve the same condition for all \(i\).) By Weyl’s equidistribution theorem, we can find a sequence \((n_i)_{i=1}^\infty\) such that \[\begin{align} \tag{43}\lim_{i \to \infty} \left< \alpha n_i \right> &= 0, \\ \tag{44}\lim_{i \to \infty} \left< \alpha n_i^2 \right> &= - \left< 2 \alpha h m + \alpha m^2 \right> . \end{align}\] By 44 can estimate \[\begin{align} \lim_{i \to \infty} \left\lVert \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} & < \frac{\varepsilon}{m} m + \varepsilon= 2\varepsilon, \end{align}\] and hence 41 follows. It remains to deal with 42 . We can compute \[\begin{align} \lim_{i \to \infty} \left\lVert \alpha(h+m+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} = \left\lVert \alpha(h+m)^2+ \lim_{i\to\infty} \left< \alpha n_i^2 \right> \right\rVert_{\mathbb{R}/\mathbb{Z}} = \left\lVert \alpha h^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \rho, \end{align}\] as needed. ◻

The next property that we would like to express is that \(\alpha n^2\) and \(\alpha m^2\) are close modulo \(1\). With this goal in mind, we define \[\begin{align} \label{eq:bohr:def-nu} \nu(m,\tilde{m},N): &&& (\exists\;L)\;(\forall\;n)\;\lambda(n,L) \wedge \kappa(m+n,L) \Rightarrow \\ &&& \kappa(\tilde{m}+n,N) & && (m,\tilde{m},N \in \mathbb{N}). \end{align}\tag{45}\]

Lemma 13. For each \(\varepsilon> 0\) there exists \(N \in \mathbb{N}\) such that for all \(m,\tilde{m} \in \mathbb{N}\), if \(\nu(m,\tilde{m},N)\) holds, then \(\left\lVert \alpha(m^2 - \tilde{m}^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\). Conversely, for each \(N \in \mathbb{N}\) there exists \(\varepsilon> 0\) such that if \(\left\lVert \alpha(m^2 - \tilde{m}^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\), then \(\nu(m,\tilde{m},N)\) holds.

Proof. By Lemma 11, condition \(\nu(m,\tilde{m},N)\) is equivalent to the statement that for each sequence \((n_i)_{i=1}^\infty\) such that \(\left\lVert \alpha n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) and \(\left\lVert \alpha(m+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) as \(i \to \infty\) we have \(\kappa(\tilde{m} + n_i,N)\) for all sufficiently large \(i\). Consider any sequence \((n_i)_{i=1}^\infty\) such that \(\left\lVert \alpha n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\). Then \[\begin{align} \left< \alpha(m+n_i)^2 - \alpha m^2 - \alpha n_i^2 \right> & \to 0 & \text{as } i \to \infty,\\ \left< \alpha(\tilde{m}+n_i)^2 - \alpha\tilde{m}^2 - \alpha n_i^2 \right> & \to 0 & \text{as } i \to \infty. \end{align}\] Thus, if \(\varepsilon> 0\) and \(N\) is sufficiently large that \(\kappa(m,N)\) implies \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon/3\), then (taking any sequence \((n_i)_{i=1}^\infty\) as above) \(\nu(m,\tilde{m},N)\) implies \[\begin{align} \left\lVert \alpha(m^2-\tilde{m}^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} & \leq \lim_{i \to \infty} \left\lVert \alpha(m+n_i)^2 - \alpha m^2 - \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \\ & + \left\lVert \alpha(\tilde{m}+n_i)^2 - \alpha\tilde{m}^2 - \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \\ &+ \left\lVert \alpha(m+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} + \left\lVert \alpha(\tilde{m}+h_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \leq \varepsilon/3. \end{align}\] Conversely, if \(N \in \mathbb{N}\) and \(\varepsilon> 0\) is sufficiently small that \(\left\lVert \alpha m^2 \right\rVert_{\mathbb{R}/\mathbb{Z}}< 3\varepsilon\) implies \(\kappa(m,N)\), then for any \(m,\tilde{m}\) with \(\left\lVert \alpha(m^2 - \tilde{m}^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} < \varepsilon\) and any sequence \((n_i)_{i=1}^\infty\) as above we have \[\begin{align} \left\lVert \alpha(\tilde{m}+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} & \leq \lim_{i \to \infty} \left\lVert \alpha(m+n_i)^2 - \alpha m^2 - \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \\ & + \left\lVert \alpha(\tilde{m}+n_i)^2 - \alpha\tilde{m}^2 - \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \\ &+ \left\lVert \alpha(m+n_i)^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} + \left\lVert \alpha(m^2-\tilde{m}^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} \leq \varepsilon, \end{align}\] and hence \(\kappa(\tilde{m}, N)\) holds. ◻

4.3 Divisibility↩︎

Finally, we are able to define divisibility. Consider the relation \[\begin{align} \delta(m,\tilde{m}): &&& (\forall\;N)\;(\exists\;L)\;(\forall\;n)\;\nu(m+n,m,L) \wedge \kappa(n,L) \Rightarrow \\ &&& \nu(\tilde{m}+n,\tilde{m},N) & && (m,\tilde{m}\in \mathbb{N}). \end{align}\]

Lemma 14. For each \(m,\tilde{m} \in \mathbb{N}\) we have \(\delta(m,\tilde{m})\) if and only if \(m \mid \tilde{m}\).

Proof. Condition \(\delta(m,\tilde{m})\) is equivalent to the statement that for each \(\varepsilon> 0\), for each sequence \((n_i)_{i=1}^\infty\), if \(\left\lVert \alpha((m+n_i)^2 - m^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) and \(\left\lVert \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) as \(i \to \infty\), then \(\left\lVert \alpha((\tilde{m} + n_i)^2 - \tilde{m}^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} \leq \varepsilon\) for all sufficiently large \(i\). Note that the condition that \(\left\lVert \alpha((m+n_i)^2 - m^2) \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) and \(\left\lVert \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) is equivalent to \(\left\lVert 2 \alpha m n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) and \(\left\lVert \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) as \(i \to \infty\). If \(\tilde{m} = k m\) is a multiple of \(m\), then for any sequence \((n_i)_{i=1}^\infty\) like above we have \(\left\lVert 2 \alpha\tilde{m} n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} \leq k \left\lVert 2 \alpha m n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\) as \(i \to \infty\). Conversely, if \(\tilde{m}/m = a/b\) with \(\gcd(a,b) = 1\), then we can find a sequence \((n_i)_{i=1}^\infty\) with \(\left\lVert 2 \alpha m n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\), \(\left\lVert \alpha n_i^2 \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 0\), and \(\left\lVert 2 \alpha\tilde{m} n_i \right\rVert_{\mathbb{R}/\mathbb{Z}} \to 1/b\) as \(i \to \infty\). ◻

Lemma 14 implies that divisibility is definable in the first-order theory under consideration. It is a classical [22] result that multiplication can be defined in \((\mathbb{Z};<,+,\mid)\), meaning in particular that the corresponding first-order theory is undecidable. This completes the proof of Theorem 2.

References↩︎

[1]
Ivan Korec. A list of arithmetical structures complete with respect to the first-order definability. volume 257, pages 115–151. 2001. Weak arithmetics. https://doi.org/10.1016/S0304-3975(00)00113-4.
[2]
A L Semënov. Logical theories of one-place functions on the set of natural numbers. Mathematics of the USSR-Izvestiya, 22(3):587, jun 1984. URL: https://dx.doi.org/10.1070/IM1984v022n03ABEH001456, https://doi.org/10.1070/IM1984v022n03ABEH001456.
[3]
Gregory Cherlin and Françoise Point. On extensions of Presburger arithmetic. In Proceedings of the fourth Easter conference on model theory (Gross Köris, 1986), volume 86 of Seminarberichte, pages 17–34. Humboldt Univ., Berlin, 1986.
[4]
Alexis Bès. A survey of arithmetical definability. Number suppl., pages 1–54. 2001. A tribute to Maurice Boffa.
[5]
A L Semenov. On certain extensions of the arithmetic of addition of natural numbers. Mathematics of the USSR-Izvestiya, 15(2):401, apr 1980. URL: https://dx.doi.org/10.1070/IM1980v015n02ABEH001252, https://doi.org/10.1070/IM1980v015n02ABEH001252.
[6]
Jean-Paul Allouche and Jeffrey Shallit. Automatic sequences. Cambridge University Press, Cambridge, 2003. Theory, applications, generalizations. https://doi.org/10.1017/CBO9780511546563.
[7]
J. Richard Büchi. On a decision method in restricted second order arithmetic. In Logic, Methodology and Philosophy of Science (Proc. 1960 Internat. Congr.), pages 1–11. Stanford Univ. Press, Stanford, CA, 1962.
[8]
Véronique Bruyère, Georges Hansel, Christian Michaux, and Roger Villemaire. Logic and \(p\)-recognizable sets of integers. volume 1, pages 191–238. 1994. Journées Montoises (Mons, 1992). URL: http://projecteuclid.org/euclid.bbms/1103408547.
[9]
Véronique Bruyère, Georges Hansel, Christian Michaux, and Roger Villemaire. Correction to: Logic and \(p\)-recognizable sets of integers.” Bull. Belg. Math. Soc. Simon Stevin, 1(4):577, 1994.
[10]
Philipp Hieronymi and Christian Schulz. A strong version of Cobham’s theorem. In STOC ’22—Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, pages 1172–1179. ACM, New York, [2022]©2022.
[11]
Christian Schulz. Undefinability of multiplication in Presburger arithmetic with sets of powers. The Journal of Symbolic Logic, page 1–24, 2023. https://doi.org/10.1017/jsl.2023.71.
[12]
Hamoon Mousavi, Luke Schaeffer, and Jeffrey Shallit. Decision algorithms for Fibonacci-automatic words, I: Basic results. RAIRO Theor. Inform. Appl., 50(1):39–66, 2016. https://doi.org/10.1051/ita/2016010.
[13]
Chen Fei Du, Hamoon Mousavi, Luke Schaeffer, and Jeffrey Shallit. Decision algorithms for Fibonacci-automatic words, III: Enumeration and abelian properties. Internat. J. Found. Comput. Sci., 27(8):943–963, 2016. https://doi.org/10.1142/S0129054116500386.
[14]
Chen Fei Du, Hamoon Mousavi, Eric Rowland, Luke Schaeffer, and Jeffrey Shallit. Decision algorithms for Fibonacci-automatic words, II: Related sequences and avoidability. Theoret. Comput. Sci., 657(part B):146–162, 2017. https://doi.org/10.1016/j.tcs.2016.10.005.
[15]
Aseem R. Baranwal and Jeffrey Shallit. Critical exponent of infinite balanced words via the Pell number system. In Combinatorics on words, volume 11682 of Lecture Notes in Comput. Sci., pages 80–92. Springer, Cham, 2019. https://doi.org/10.1007/978-3-030-28796-2.
[16]
Philipp Hieronymi and Alonza Terry, Jr. Ostrowski numeration systems, addition, and finite automata. Notre Dame J. Form. Log., 59(2):215–232, 2018. https://doi.org/10.1215/00294527-2017-0027.
[17]
Philipp Hieronymi, Dun Ma, Reed Oei, Luke Schaeffer, Christian Schulz, and Jeffrey Shallit. Decidability for Sturmian words. In 30th EACSL Annual Conference on Computer Science Logic, volume 216 of LIPIcs. Leibniz Int. Proc. Inform., pages Art. No. 24, 23. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2022.
[18]
Vitaly Bergelson and Alexander Leibman. Distribution of values of bounded generalized polynomials. Acta Math., 198(2):155–230, 2007. https://doi.org/10.1007/s11511-007-0015-y.
[19]
A. Leibman. A canonical form and the distribution of values of generalized polynomials. Israel J. Math., 188:131–176, 2012. https://doi.org/10.1007/s11856-011-0097-2.
[20]
Boris Adamczewski and Jakub Konieczny. Bracket words: a generalisation of Sturmian words arising from generalised polynomials. Trans. Amer. Math. Soc., 376(7):4979–5044, 2023.
[21]
Victoria Ruth Neale. Bracket quadratics as asymptotic bases for the natural numbers. PhD thesis, Trinity College, University of Cambridge, 2011.
[22]
Julia Robinson. Definability and decision problems in arithmetic. J. Symbolic Logic, 14:98–114, 1949. https://doi.org/10.2307/2266510.
[23]
Jakub Byszewski and Jakub Konieczny. Sparse generalised polynomials. Trans. Amer. Math. Soc., 370(11):8081–8109, 2018.
[24]
Michael D. Boshernitzan. Uniform distribution and Hardy fields. J. Anal. Math., 62:225–240, 1994. https://doi.org/10.1007/BF02835955.
[25]
Nikos Frantzikinakis. Equidistribution of sparse sequences on nilmanifolds. J. Anal. Math., 109:353–395, 2009. https://doi.org/10.1007/s11854-009-0035-y.
[26]
Jakub Byszewski and Jakub Konieczny. Pisot numbers, Salem numbers, and generalised polynomials. Preprint, https://arxiv.org/abs/2302.06242.
[27]
Yuri V. Matiyasevich. Hilbert’s tenth problem. Foundations of Computing Series. MIT Press, Cambridge, MA, 1993. Translated from the 1993 Russian original by the author, With a foreword by Martin Davis.
[28]
Ben Green and Terence Tao. The quantitative behaviour of polynomial orbits on nilmanifolds. Ann. of Math. (2), 175(2):465–540, 2012. https://doi.org/10.4007/annals.2012.175.2.2.

  1. Formally, we may identify \(f\) with \(\left|\Sigma\right|\) unary predicates corresponding to the level sets \(f^{-1}(x)\) for \(x \in \Sigma\).↩︎

  2. This first-order theory consists, by definition, of all first-order sentences in the language of \((\mathbb{N};+,s)\) that are true, for all \(\alpha\in (0,1) \setminus \mathbb{Q}\) and \(\rho \in [0,1)\), when \(s\) is interpreted as \(s_{\alpha,\rho}\); see [17] for details.↩︎