ATM Transaction Simulation:
Combination of ACDs and Cox Process

Reza Habibi^∗

(Date: June 03, 2024; accepted: February 27, 2025; published online: June 30, 2025.)

Abstract.

Two main approaches for analyzing the ultra-frequency data such as ATM (auto teller machine) transaction are Cox process and autoregressive conditional durations (ACDs). This paper combines both models and gives its advantages. The functional data analysis proposes useful method for modeling the intensity of counting process. Two simulated cases results are verified. A real data set is analyzed and conclusions are also given.

Key words and phrases:

ATM transaction, ACD model, Cox process, functional data analysis, intensity function.

2005 Mathematics Subject Classification:

62M10.

^∗Iran Banking Institute, Central Bank of Iran, Tehran, Iran

1. Introduction

Irregularly spaced financial time series have received considerable attention in high-frequency data literature, see [6]. High frequency time series forecasting is a crucial field that tackles the analysis of data recorded at very short intervals, from seconds to fractions of a second. This discipline is fundamental in various sectors, from meteorology to finance, energy management, and quality control in manufacturing. Intraday transactions of ATM and POS [8], trading in stock markets [11], and volatility patterns in high frequency trading [9] are well-known examples of these time series. In the current paper, the ATM transactions are studied. To this end, suppose that $N(t)$ counts the number of intraday transactions of an ATM of a specific bank recorded up until time $0<t<1$ . Indeed, $N(t)$ is a counting process with intensity function $\mathit{\Lambda}(t)$ .

There are two main independent approaches for analyzing these types of time series including the Cox Poisson process (referred as approach a, in this paper) from [11] and ACD models (approach b) from [4]. However, in the current paper, the combined approach (called approach c) is proposed. There, it is assumed that both types of Cox and ACD models govern on the data, simultaneously. According to the best author’s knowledge, this model is not applied before in the literature and has many advantages which are discussed in Section 2. Although, approaches a and b are customized for problem of in hand of the current paper.

$Approach\ a$ : Cox process. Following [11], assume that $N(t)$ is modeled as Cox process. That is a Poisson process with a random intensity $\mathit{\Lambda}(t)$ where

\mathit{\Lambda}\left(t\right)=\int^{t}_{0}{\lambda\left(s\right)ds}.

The Cox process is a type of point process. For comprehensive review on Cox process and generally point processes, see [3].

Often, to find the functional forms of $\lambda$ and $\mathit{\Lambda}$ , the functional data analysis (FDA) method is proposed which gives an approximation for $\lambda\left(t\right)$ .

$Approach\ b$ : ACD models. The ACD model mainly uses the stopping times of $N\left(t\right)$ and stopped process properties, without assuming $N(t)$ being a Poisson process, as Cox process assumes. To describe more, let ${\tau}_{k}$ be the time of $k$ -th transaction (in a day)

{\tau}_{k}={\mathrm{inf}}_{t}\left\{N\left(t\right)=k\right\}.

The related duration be $L_{k}$ defined by

L_{k}={\tau}_{k}-{\tau}_{k-1}.

Let $L_{k}$ be modeled by ACD model (with intercept $c$ ) from [4]; i.e.,

L_{k}={c+\vartheta}_{k}e_{k},

at which errors $e_{k}$ ’s are independent, positive random variables with $E\left(e_{k}\right)=1$ and

{\vartheta}_{k}=\alpha+\sum^{p}_{j=1}{{\gamma}_{j}L_{k-j}+\sum^{q}_{j=1}{{% \beta}_{j}{\vartheta}_{k-j},}}

where $p,\ q$ are unknown dimensions of model which are optimized during solving the case study problem while parameters ${\gamma}_{j}$ and ${\beta}_{j}$ are unknown parameters which should be estimated. The authors from [4] proposed a close relationship between ACD and GARCH models. The $A C D m$ package of software $R$ estimates these parameters, accurately and quickly, see [1].

$Approach\ c$ : Combined model. Here, it is assumed that both models of Cox and ACD are hold, simultaneously. Some advantages of this approach are:

1.

Under this setting, the exact Monte Carlo simulation for dynamics of $\lambda\left(t\right)$ is derived.
2.

Often, FDA $\lambda\left(t\right)$ is a time-consuming task, because of choosing the length of linear combination of orthogonal basis function.
3.

Choosing basis functions and number of them are a little subjective which is critical in applied problems.
4.

By combined model, $N(t)$ is also simulated, directly, by Binomial distribution which approximates the Poisson distribution, see Section 2.3.

The rest of paper is organized as follows. In the next section, two methods are proposed to derive the dynamics of $\lambda\left(t\right)$ . Section 3 gives the results of simulations. A real data set is analyzed in Section 4. Concluding remarks are proposed in Section 5.

2. Dynamics of $\boldsymbol{\lambda}\left(\boldsymbol{t}\right)$

Here, dynamics of $\mathrm{\Lambda}\left(\mathrm{t}\right)$ and $\lambda\left(t\right)\$ are derived. To this end, first, the FDA method is proposed which gives an approximation for $\lambda\left(t\right)$ . Then, under the combined model setting, the derivation of $\lambda\left(t\right)$ is based on Monte Carlo simulation which uses the exact distribution of partial sums of $L_{k}$ ’s.

2.1. FDA $\boldsymbol{\lambda}\left(\boldsymbol{t}\right)$

In practice, FDA is used to model the intensity function $\lambda\left(t\right)$ of Poisson process as random element in Hilbert space, see [7]. To this end, considering $n$ days, let $N_{i}(t)$ the number of transactions throughout the $i$ -th day with intensity function ${\lambda}_{i}\left(t\right)$ . Following [8], to remove the periodically effects for different days, let ${\lambda}_{i}\left(t\right)$ be the intensity function of $i$ -th day and let

{\delta}_{i}\left(t\right)={\lambda}_{i}\left(t\right)-{\lambda}_{i-7}\left(t% \right),

and consider the following functional autoregressive model for ${\delta}_{i}\left(t\right)$ as follows

{\delta}_{i}\left(t\right)=\int^{1}_{0}{\rho(s,t){\delta}_{i-1}\left(s\right)}% ds+{\varepsilon}_{i}\left(t\right),

where kernel $\rho$ is estimated using the functional principal component and error terms

{\{\varepsilon}_{i}\left(t\right);t\in[0,1]\}

are supposed to be independent functions such that ${E\{\varepsilon}_{i}\left(t\right)\}=0$ for each $t\in[0,1]$ and

E\int^{1}_{0}{{\varepsilon}^{2}_{i}\left(t\right)dt={\sigma}^{2}<\infty}.

For comprehensive review on functional data analysis, see [10]. Package fda.usc of software $R$ is useful instrument for studying functional time series data, see [5]. Then, use the basis representation for $\lambda(t)$ such as

\lambda\left(t\right)=\sum^{m}_{k=1}{b_{k}{\phi}_{k}\left(t\right)\ },

(see [10]) for basis functions ${\phi}_{k}\left(t\right)$ for $k=1,\dots,m$ , say Fourier basis functions. Therefore,

\mathit{\Lambda}\left(t\right)=\int^{t}_{0}{\sum^{m}_{k=1}{b_{k}{\phi}_{k}% \left(s\right)ds=\sum^{m}_{k=1}{b_{k}\int^{t}_{0}{{\phi}_{k}\left(s\right)ds.}% \ }\ }}

In the literature, widely used selections for ${\phi}_{k}\left(t\right)$ are

{\phi}_{k}\left(t\right)=\left\{\begin{array}[]{cc}\frac{{\mathrm{sin}\left(0.% 5\left(k+1\right)t\right)\ }}{\sqrt{\mathrm{\pi}}}&k\ \mathrm{is\ odd\ integer% ,}\\ \frac{{\mathrm{cos}\left(0.5kt\right)\ }}{\sqrt{\mathrm{\pi}}}&k\ \mathrm{is\ % even\ integer.}\end{array}\right.

However, in practice, we find $\widehat{\mathit{\Lambda}}(t)$ which we have

\widehat{\mathit{\Lambda}}\left(t\right)=\mathit{\Lambda}\left(t\right)+\zeta% \left(t\right),

for some error terms $\zeta\left(t\right)$ . Therefore,

\widehat{\mathit{\Lambda}}\left(t\right)=\sum^{m}_{k=1}{b_{k}\int^{t}_{0}{{% \phi}_{k}\left(s\right)ds+}\zeta\left(t\right).}

Following [2], this is a type of functional regression analysis. To this end, variables $\widehat{\mathit{\Lambda}}\left(t_{j}\right)$ and $\int^{t_{j}}_{0}{{\phi}_{k}\left(s\right)ds}$ are computed for $t=t_{j};\ j=1$ ,…, $M$ , ( $m<M$ ) and parameters of a multiple regression model are estimated.

2.2. $\boldsymbol{\lambda}\left(\boldsymbol{t}\right)$ in combined model

Here, the exact functional form of $\lambda(t)$ is derived. To this end, let $L_{0}={\tau}_{0}=0$ and

{\tau}_{k}=\sum^{k}_{j=1}{L_{j}}.

Notice that

P\left({\tau}_{k}\leq t\right)=P(N\left(t\right)>k).

Hence,

P\left({\tau}_{k-1}\leq t\right)=P(N\left(t\right)>k-1).

Let $p_{k}\left(t\right):=P\left(N\left(t\right)=k\right)$ . One can see that

p_{k}\left(t\right)=P\left(N\left(t\right)>k-1\right)-P\left(N\left(t\right)>k% \right)=P\left({\tau}_{k-1}\leq t\right)-P\left({\tau}_{k}\leq t\right).

It is concluded that

p_{k}\left(t\right)=P\left(\sum^{k-1}_{j=1}{L_{j}}\leq t\right)-P\left(\sum^{k% }_{j=1}{L_{j}}\leq t\right).

This relation leads to the computation of the exact probabilities. Notice that

p_{k}\left(t\right)=P\left({\tau}_{k-1}\leq t<{\tau}_{k}\right)=P\left(\sum^{k% -1}_{j=1}{L_{j}}\leq t<\sum^{k}_{j=1}{L_{j}}\right).

Therefore, the Monte Carlo estimate of $p_{k}(t)$ is the number of times (in M repetitions of Monte Carlo simulations) that the random interval

\left[\sum^{k-1}_{j=1}{L_{j}};\sum^{k}_{j=1}{L_{j}}\right)

contains the constant number $t$ . In practice, after fitting and $ACD(p,q)$ to $L_{k}$ , the Monte Carlo simulation method approximates the exact distribution of $\sum^{k}_{j=1}{L_{j}}$ and $\sum^{k-1}_{j=1}{L_{j}}$ , hence $P\left(N\left(t\right)=k\right)$ is computed. Notice that

{k!p}_{k}\left(t\right)=\mathrm{exp}\mathrm{}(-\mathrm{\Lambda}\left(\mathrm{t% }\right)\mathrm{)}{\mathit{\Lambda}}^{k}\left(\mathrm{t}\right).

Therefore,

\mathrm{\Lambda}\left(\mathrm{t}\right)-k\operatorname{log}\left(\mathrm{% \Lambda}\left(\mathrm{t}\right)\right)+b_{k}\left(t\right)=0,

where

b_{k}\left(t\right)=\mathrm{log}\mathrm{}({k!p}_{k}\left(t\right)).

This is a non-linear equation and root-finding methods such as Newton-Raphson are applicable, as follows: For iteration $r$ -th, let

{\mathrm{\Lambda}}_{r}\left(t\right)={\mathrm{\Lambda}}_{r-1}\left(t\right)+% \frac{{\mathrm{\Lambda}}_{r-1}\left(t\right)-k\operatorname{log}\left({\mathrm% {\Lambda}}_{r-1}\left(t\right)\right)+b_{k}\left(t\right)}{1-k/{\mathrm{% \Lambda}}_{r-1}\left(t\right)}.

To find $\lambda\left(t\right),$ after finding $\mathit{\Lambda}(t)$ for discrete values of $t$ ’s, the numerical differentiation is applied to find discrete values of $\lambda(t)$ . Then, a smoothing method such as spline or smoothing polynomials is used to derive the functional form of $\lambda(t)$ .

2.3. Another combined $\boldsymbol{\lambda}\left(\boldsymbol{t}\right)$

In Sections 2.1 and 2.2, we proposed some methods for obtaining functional form of $\lambda(t)$ based on Cox process and ACD models. However, in practice, the series of numbers of events that each occurred in small fraction of the time are recorded and it is necessary to simulate $N(t)$ , itself, directly.

The idea behind this method is that the Binomial distribution approximates the Poisson distribution. To propose the method, suppose that throughout a crowded business day which is expected the possible numbers of $t$ ’s in $\left(t-h,t\right)$ where at them transaction occur i.e., $n$ (of Bernoulli distribution) is large. However, because of some political or social events which has happened in yesterday, the probability of transaction $p_{t}$ is too low such that

np_{t}\approx\int^{h}_{t-h}{\lambda\left(s\right)ds}.

Following [8] and noticing that

\int^{h}_{t-h}{\lambda\left(s\right)ds}\approx h\lambda\left(t\right),

it is seen that $\frac{1}{h}np_{t}$ is a good estimate of $\lambda\left(t\right)$ . Therefore, $N\left(t\right)$ is simulated by sampling from binomial distribution with parameter $\left(n,p_{t}\right)$ , directly and

\lambda\left(t\right)=\tfrac{1}{h}np_{t}.

Then, by collecting number of transactions and fitting an ACD model, the empirical estimate of $N(t)$ and consequently $p_{t}$ are estimated.

Suppose that, the dynamic of $p_{t}$ is proposed. For example, consider the dynamics of $p_{t}$ given by Ito stochastic differential equation, as follows:

dp_{t}=\alpha p_{t}dt+\delta p_{t}dB_{t},

where $B_{t}$ is standard Brownian motion on (0,1). To obtain parameters $\alpha,\ \delta$ , notice that they are mean and standard deviation of $\frac{dp_{t}}{p_{t}dt}$ which are estimated by their related samples values. Therefore, the dynamic of $\lambda\left(t\right)$ is proposed.

3. Simulations

In this section, some simulated cases are analyzed. Banks often refuse to provide tick-by-tick transaction data of ATMs and do not make this type of data available (or at least hard available) to the public due to network security issues and keeping customer’s secrets. However, a small part of database is usually given to researchers from the core system of databases of banks. This is why, in the current paper, we only survey the simulated situations which correspond to real data. For using simulated data instead of real one, we must be sure that the simulated data with the combined model are good approximations for the real data. However, since in both cases, we use the dataset in [11], we are sure that these considerations are checked. Case 1 studies the Cox process with known dynamic for $\lambda$ . Case 2 gives simulation results under the combined model setting.

Case 1: $\lambda(t)$ as OU process. In the Cox process, motivated by [11], suppose that $\lambda(t)$ is an Ornstein-Uhlenbeck (OU process) process defined by

d\lambda\left(t\right)=-\beta\lambda\left(t\right)dt+\beta tdz,\

where $d z$ is increment of Brownian motion, $t\in(0,1)$ and $0<\beta<\infty$ . Here, it is supposed that $\beta=0.2$ which corresponds to the empirical results from [11]. Consider $t_{i}=\frac{i}{100},\ i=0,\dots,99$ and let $\mathit{\Lambda}\left(t_{i}\right)=\frac{1}{100}\sum^{b}_{u=1}{\lambda(u)},$ where $b=[100t_{i}]$ . To simulate $N(t)$ at $t_{i}$ ’s, increments $N\left(t_{i}\right)-N(t_{i-1})$ are samples from Poisson distribution with rate $\frac{1}{100}\sum^{b}_{u=a}{\lambda(u)}$ where $a=\left[100t_{i-1}\right].$ In this way, the partial sums of increments generate paths of Poisson process. Therefore, $L_{k}$ ’s are computable. It is easy to see that $L_{k}$ is $ACD(1,1)$ process with intercept, as follows

L_{k}=0.0898+{\vartheta}_{k}e_{k},

where $e_{k}$ has exponential distribution with mean 1 and

{\vartheta}_{k}=0.0017+0.885e_{k-1}-1.58{\vartheta}_{k-1}.

Case 2: Combined $\lambda(t)$ . Here addition to Cox process assumption, assume that $L_{k}$ come from process $ACD(1,1)$ . For the weekly data from [11], the ACD model is defined by

L_{k}=0.01+{\vartheta}_{k}e_{k},

{\vartheta}_{k}=0.002+0.65e_{k-1}-2{\vartheta}_{k-1},

with $e_{k}$ being exponentially distributed random variable with rate 1. Hence, empirical distributions of $L_{k},\ k=1,2$ are computed. Next, using the Monte Carlo method proposed in subsection 2.1, $P(N\left(t\right)=2)$ is computed for various values of $t_{i}$ ’s. Then, using the Newton-Raphson and numerical differentiation, values of $\mathit{\Lambda}(t)$ and $\lambda(t)$ are computed, respectively. The following figure gives the plot of $\lambda(t)$ . Smoothing $\lambda(t)$ by basis Fourier function, it is seen that

\lambda\left(t\right)=\big{(}1+{\mathrm{sin}\left(t\right)}+{\mathrm{cos}\left% (t\right)}\big{)}/\sqrt{\pi}.

Refer to caption — Figure 1. Plot of $\lambda(t)$

Although it is difficult to obtain real tick-by-tick data in practice, nevertheless, the case 3 provides an alternative method for reconstructing $\ \lambda(t)$ . For the weekly data from [11], the following smoothing results are provided:

{\lambda}_{i}\left(t\right)=\Big{(}c_{0i}+c_{1i}{\mathrm{sin}\left(t\right)\ }% +c_{2i}\mathrm{cos}\mathrm{}(t)\Big{)}/\sqrt{\pi},

where $\pi=3.141592$ and $c_{ji},\ j=0,1,2$ are periodic functions with period 7 given as follows:

$i$	1	2	3	4	5	6	7
$c_{0i}$	0.2	0.2	0.5	0.5	0.1	0.25	0.25
$c_{1i}$	0.1	0.1	0.5	0.25	0.5	0.2	0.5
$c_{2i}$	0.2	0.2	0.1	0.1	0.1	0.25	0.25

Table 1. Values of

c_{ji}

for

j=0,1,2

and

i=1,\dots,7

4. Real data sets

Here, the method of Section 2.3 is applied to 3 real-time series.

Data set 1. The dataset includes 11520 observations which are 15-minute by 15-minute ATM transactions of a selected branch of an Iranian Bank ABC (which we are not naming for security reasons) from March 11, 2024 to October 11 2024 (30 days) during 7 AM to 7 PM. For a day, there are $n=48$ 15 minutes. Therefore, $h=1/48$ . Thus, $\lambda\left(t\right)=p_{t}$ . The time series plot of first 5000 observations is given as follows:

It is seen that $\mathrm{\alpha}=0.641,\ \delta=0.457$ . Here, we provide fittings of the model on many real high frequency data sets, then obtain the optimal parameters, plot on the same figure the real data and the optimal model and last analyze the corresponding residuals. The following plot gives the simultaneously, time series of actual $\lambda\left(t\right)$ (blue line) and its estimate (red line).

The following table gives the max, min, mean and standard deviation (sd) of 5000 residuals

\Big{|}\frac{\lambda\left(t\right)}{est(t)}-1\Big{|}.

The table shows that errors are negligible.

Max	Min	Mean	SD
0.0632	0.0024	0.0414	0.019

Table 2. Residuals properties: maximum, minimum, mean, and standard deviation.

Data set 2. In the second data set, the transactions of an ATM along one day for 6336 days are recorded. First the following ACD model is fitted to duration of transactions

L_{k}=0.045+{\vartheta}_{k}e_{k},

{\vartheta}_{k}=0.001+0.34e_{k-1}-1.68{\vartheta}_{k-1},

$k=1,2,\dots,6336$ . The following plot gives real $N(t)$ is plotted against its estimated process derived from above ACD model using simulation of a Poisson process based on simulating the $L_{k}$ ’s partial sums. To better presentation the first 3000 observations i.e., the actual $N\left(t\right)$ (blue line) and its estimate (red line) are presented. This figure shows the maximum closeness of both series.

Again, the summaries of errors are proposed in the following table.

Max	Min	Mean	SD
0.0475	0.0064	0.0325	0.032

Table 3. Differences between

N(t)

and its estimate: maximum, minimum, mean, and standard deviation.

Also, a Poisson process is fitted to data, based on functional estimate of $\lambda\left(t\right)$ , given by

\lambda\left(t\right)=(0.01+0.002{\mathrm{sin}\left(t\right)\ }+0.25\mathrm{% cos}\mathrm{}(t))/\sqrt{2\pi}.

The following table gives the related errors.

Max	Min	Mean	SD
0.0734	0.0055	0.0455	0.043

Table 4. Differences between actual

N(t)

and its Poisson simulation: maximum, minimum, mean, and standard deviation.

Data set 3. Here, the functional form of $\lambda\left(t\right)$ of previous data set are compared with its actual values. First the following table shows that the errors are negligible. Then, different scenarios are studied using 1 times standard deviations of errors as shocks to $\lambda\left(t\right)$ . The following figure shows the errors of actual $\lambda\left(t\right)$ and its functional estimates. Shocks are simulated using normal distributions with zero means and 1 times standard deviations.

As seen, the shocks increase as the number of time series increases. However, the shocks are negligible in the estimate of $\lambda\left(t\right)$ .

5. Concluding Remarks

This manuscript has many advantages and highlights as follows:

1.

The compatibility of the combination of two common models used in the analysis of data with high frequency was examined and it was seen in the simulation section that these two models can be recovered from each other.
2.

The use of functional data analysis was used as a practical solution for modeling the intensity function of the Poisson process, and the performance of this solution was seen alongside the previous two methods.
3.

Mathematical models were developed to be useful for simulation analysis.