Global complexities for infinite sequences

1 year ago

Mira Anisiu

(original), paper

Abstract

Authors

Mira-Cristiana Anisiu
Tiberiu Popoviciu Institue of Numerical Analysis, Romanian Academy, Romania

Keywords

?

Paper coordinates

M.-C. Anisiu, Global complexities for infinite sequences, in Analysis, Functional Equations, Approximation and Convexity, Proceeding of the Conference held in Honor of Professor Elena Popoviciu, Eds. L. Lupşa and M. Ivan, Carpatica, Cluj-Napoca, 1999, 7-11 (pdf file here)

PDF

https://ictp.acad.ro/anisiu/papers/1999-Anisiu-GlobComp.pdf

About this paper

Journal

Publisher Name

DOI

Print ISSN

Online ISSN

google scholar link

[1] J.-P. Allouche, Sur la complexite des suites infnies, Bull. Belg. Math. Soc. 1(1994), 133-143
[2] P. Arnoux, Ch. Mauduit, I. Shiokawa, Jun-ichi Tamura, Complexity of sequences defined by billiards in the cube, Bull. Soc. Math. France 122(1)(1994), 1-12
[3] S. Ferenczi, Complexity of sequences and dynamical systems, to appear in Discrete Math. 206(1999)
[4] S. Ferenczi, Z. K·sa, Complexities for finite factors of infinite sequences, to appear in Theor. Comput. Sci. 218(1)(1999), 177-195
[5] A. Ivanyi, On the d-complexity of words, Ann. Univ. Sci. Budapest Sect. Comput. 8(1987), 69-90
[6] J. Shallit, On the maximum number of distinct factors in a binary string, Graphs and Combinatorics 9(1993), 197-200

1999-Anisiu-GlobComp

Global complexities for infinite sequences

Mira-Cristiana Anisiu*"T. Popoviciu" Institute of Numerical AnalysisP. O. Box 68, 3400 Cluj-Napoca

1 Introduction

The language complexity of a finite word or infinite sequence is aimed to give a measure of the number of different factors in the given word or sequence. In fact, the definitions in the finite or infinite case coincide.

Let

A

be a finite nonvoid alphabet and

U = u_{0} u_{1} \dots

an infinite sequence with

u_{i} \in A, i \in N

. The complexity of the sequence will be a function

p_{U} : N^{*} \to N

given by

\begin{matrix} (1) & p_{U} (n) = ♯ L_{n} (U), n \in N^{*} \end{matrix}

♯

denoting the cardinal of the set

L_{n} (U)

of the factors of length

n

in

U

.
For the finite word

w

of length

q \in N^{*}

, the complexity

p_{w}

may be conceptually defined in the same way. But, because

♯ L_{n} (w) = 0

for

n > q

, we can consider only the restriction on the set

{1, \dots, q}

, so

p_{w} : {1, \dots, q} \to N

,

\begin{matrix} (2) & p_{w} (n) = ♯ L_{n} (w), n \in {1, \dots q} . \end{matrix}

The complexity functions

p_{U}

, respectively

p_{w}

, estimate the richness in factors of length

n

of a sequence or of a word, for every

n \in N^{*}

or

n \in {0, \dots, q}

. It would be of great help to have a global indicator of the complexity.

In the case of finite words there are two known definitions, the first one of total complexity given independently by Iványi [5] and Shallit [6], the second of maximal complexity, suggested by Rauzy.

Definition 1 For the word

w

of length

q \in N^{*}

, the total complexity is given by

\begin{matrix} (3) & K_{w} = \sum_{j = 1}^{q} p_{w} (j), \end{matrix}

and the maximal complexity is

\begin{matrix} (4) & C_{w} = {max}_{j = 1}^{q} p_{w} (j) \end{matrix}

The aim of this paper is to define similar global complexities (not depending on the specific length

n

of the factors) for infinite sequences.

2 The significance of total and maximal complexity for words

Given a word of length

q

, the complexity function

p_{w}

may be considered as a vector in the space

R^{q}

. On this space we have the well-known Minkowski norm

‖ x ‖_{1} = \sum_{j = 1}^{q} | x_{j} |

, Chebyshev norm

‖ x ‖_{\infty} = {max}_{j = 1}^{q} | x_{j} |

, or Euclid norm

‖ x ‖ = {(\sum_{j = 1}^{q} x_{j}^{2})}^{1 / 2}

, which are of course equivalent. It is obvious that if we denote

p_{w} = (p_{w} (1), \dots, p_{w} (q))

we shall have for the total complexity

K_{w} = {‖ p_{w} ‖}_{1}

and for the maximal complexity

C_{w} = {‖ p_{w} ‖}_{\infty}

We can also consider a Euclidian complexity given by

‖ p_{w} ‖

, which has the disadvantage that its values are not in general integer numbers.

The total and maximal complexities were used by Ferenczi and Kasa [4] to define upper and lower complexities for infinite sequences, in the following way:

the upper and lower total finite-word complexity function by

\begin{aligned} K_{U}^{+} (n) = max_{i} K (u_{i} u_{i + 1} \dots u_{i + n - 1}), \\ (5) & K_{U}^{-} (n) = min_{i} K (u_{i} u_{i + 1} \dots u_{i + n - 1}); \end{aligned}

the upper and lower maximal finite-word complexity function by

\begin{aligned} C_{U}^{+} (n) = max_{i} C (u_{i} u_{i + 1} \dots u_{i + n - 1}), \\ (6) & C_{U}^{-} (n) = min_{i} C (u_{i} u_{i + 1} \dots u_{i + n - 1}) . \end{aligned}

These two notions, having many interesting properties, are extensively studied in [4]. Being functions of

n \in N^{*}

, the upper and lower total (maximal) finite-word complexity functions are similar to the initial complexity function

p_{U}

. What we intend is to define global complexities for the case of infinite sequences too.

3 Global complexities for infinite sequences

There already exist some notions to estimate the global complexity for infinite sequences, as for example the topological entropy [1]

\begin{matrix} (7) & h_{U} = lim_{n \to \infty} \frac{\log p_{U} (n)}{n \log ♯ A}, \end{matrix}

which satisfies

0 \leq h_{U} \leq 1

for every sequence

U

.
The definitions we give extend those for finite words to infinite sequences; to this aim it is necessary to "normalize" the values of the complexity function

p_{U} (n)

by dividing them with

(♯ A)^{n}

.

Definition 2 For the infinite sequence

U

, the total complexity is given by

\begin{matrix} (8) & K_{U} = \sum_{n = 1}^{\infty} \frac{p_{U} (n)}{(♯ A)^{n}}, \end{matrix}

and the maximal complexity by

\begin{matrix} (9) & C_{U} = {sup}_{n = 1}^{\infty} \frac{p_{U} (n)}{(♯ A)^{n}} . \end{matrix}

Remark 1 The total complexity in definition 2 is not necessarily finite; for example, for the Champernowne word containing successively all the binary written numbers

011011100101111 \dots

we have

p_{U} (n) = 2^{n}

and

K_{U} = \infty

, while

C_{U} = 1

.

For this reason, instead of definition 2 we propose one for normal complexity, which will have values less than 1 .

Definition 3 For the infinite sequence

U

, the normal complexity is given by

\begin{matrix} (10) & K_{U} = \sum_{n = 1}^{\infty} \frac{1}{(♯ A)^{n}} \frac{p_{U} (n)}{1 + p_{U} (n)} \end{matrix}

Remark 2 In the definitions above, there appears the sequence

c_{n} = \frac{p_{U} (n)}{(# A)^{n}}, n \in N^{*}

Because of the inequality

p_{U} (n + 1) \leq (# A) p_{U} (n), n \in N^{*}

holding for each complexity function, the sequence

c_{n}

is non-increasing, so

C_{U} = c_{1} = 1

. For the Champernowne sequence we have

c_{n} = 1, n \in N^{*}

.

4 Global complexities for sequences with known complexity functions

In the following we consider sequences for which the complexity function is known and try to determine (at least approximately) their global complexities. The alphabet has three symbols in example 4.1,

a + b

in 4.2 and two symbols for Sturmian sequences and those in 4.3 and 4.4. The topological entropy is

h_{U} = 0

, excepting example 4.4.

4.1 Sequences defined by billiards in the cube

A sequence generated by the structure of billiard trajectories in the cube, associating to a trajectory starting with totally irrational direction the sequence with values in

{1, 2, 3}

given by coding 1 (respectively 2,3 ) any time
the particle rebounds on a frontal (lateral, horizontal) side of the cube, was shown in [2] to have the complexity

p_{U} (n) = n^{2} + n + 1

.

Proposition 1 For a sequence defined by the cubic billiard with totally irrational direction, the total, maximal and normal complexity will be

K_{U} = 2.75, C_{U} = 1 and K_{U} \approx 0.3994006256 .

4.2 Sequences with $p_{U} (n) = a n + b$

For an important class of sequences the complexity function is linear.
Proposition 2 Let

U

be a sequence having the complexity function given by

p_{U} (n) = a n + b, n \in N^{*} (a \in N^{*}, b \in N, a + b \geq 2)

. Its total, maximal and normal complexity will be

\begin{aligned} K_{U} =_{2} F_{1} (1, \frac{2 a + b}{a}; \frac{a + b}{a}; \frac{1}{a + b}), C_{U} = 1 \\ K_{U} = \frac{1}{a + b + 1}_{3} F_{2} (1, \frac{2 a + b}{a}, \frac{a + b + 1}{a}; \frac{a + b}{a}, \frac{2 a + b + 1}{a}; \frac{1}{a + b}) \end{aligned}

where

_{α} F_{β}

denotes the hypergeometric function.
Remark 3 Well-known sequences of this type are Sturmian sequences (which are not ultimately periodic, but are recurrent), for which

a = b = 1

; they have the global complexities given by

K_{U} = 3, C_{U} = 1 and K_{U} = 7 / 2 - 4 \ln 2 \approx 0.727411278 .

4.3 The power sequence

Let us consider the words

v_{i} = 0^{i}, w_{i} = 1^{i}

and

u_{i} = v_{i} w_{i}, i \in N^{*}, u_{i}

being obtained by concatenation of

v_{i}

and

w_{i}

. The power sequence

U

is given by

\begin{matrix} (11) & U = u_{1} u_{2} u_{3} \dots = 010011000111 \dots \end{matrix}

and has the complexity

p_{U} = n (n + 1) / 2 + 1

.
Proposition 3 For the power sequence (11), the global complexities are given by

K_{U} = 5, C_{U} = 1, K_{U} \approx 0.7595479501

4.4 Champernowne sequence

This sequence was mentioned in Section 3 and it is obtained by writing successively all the binary written numbers. Its complexity function is

p_{U} (n) = 2^{n}

and the topological entropy is

h_{U} = 1

.

Proposition 4 For the Champernowne sequence, the global complexities are given by

K_{U} = \infty, C_{U} = 1, K_{U} \approx 0.7644997803 .

Acknowledgements. The author thanks the Ministry of Science and Technology for supporting partially this work (3004GR/1997).

References

[1] J.-P. Allouche, Sur la complexité des suites infinies, Bull. Belg. Math. Soc. 1(1994), 133-143
[2] P. Arnoux, Ch. Mauduit, I. Shiokawa, Jun-ichi Tamura, Complexity of sequences defined by billiards in the cube, Bull. Soc. Math. France 122(1)(1994), 1-12
[3] S. Ferenczi, Complexity of sequences and dynamical systems, to appear in Discrete Math. 206(1999)
[4] S. Ferenczi, Z. Kása, Complexities for finite factors of infinite sequences, to appear in Theor. Comput. Sci. 218(1)(1999), 177-195
[5] A. Iványi, On the

d

-complexity of words, Ann. Univ. Sci. Budapest Sect. Comput. 8(1987), 69-90
[6] J. Shallit, On the maximum number of distinct factors in a binary string, Graphs and Combinatorics 9 (1993), 197-200

*E-mail: mira@math.ubbcluj.ro

1999

Global complexities for infinite sequences

Abstract

Authors

Keywords

Paper coordinates

PDF

About this paper

Journal

Publisher Name

DOI

Print ISSN

Online ISSN

References

Global complexities for infinite sequences

1 Introduction

2 The significance of total and maximal complexity for words

3 Global complexities for infinite sequences

4 Global complexities for sequences with known complexity functions

4.1 Sequences defined by billiards in the cube

4.2 Sequences with p U ( n ) = a n + b p U ( n ) = a n + b p_(U)(n)=an+bp_{U}(n)=a n+bpU(n)=an+b

4.3 The power sequence

4.4 Champernowne sequence

References

Related Posts

4.2 Sequences with $p_{U} (n) = a n + b$