Advanced mathematical methods for biology

It is still a great challenge to come up with quantitative assertions about complex biological systems. Especially, if one aims towards a functional understanding at the cell, tissue or organismic level, it typically leads to quite involved modells. Analytical progress can be made with simplifying assumptions and the restriction to limiting cases.

This course will discuss a potpourri of mathematical methods ranging from analytical techniques to numerical methods and inform the participant on how to apply them to answer biological question and have enormous fun in doing so.

The mathematical techniques encompass stochastic systems, numerical bifurcation analysis, information theory, perturbation theory and Fourier analysis. The biological examples are chosen mostly from neurobiology, sensory ecology, but also bacterial communication and evolution.

1.1 (Dis)claimer

This scriptum shall evolve. It will have flaws (didactic ones and outright errors) but you, the reader, student, search bot, provide the selective preasure to improve it. Please write to the lecturer if you find fawlt of any kind with it.

2 The Floquet theory of the aktion potential

2.1 Periodic event

2.2 Tonic spikes are limit cycles

Many intersting differential equations describing biological dynamics can be cast into the following form

where

\vec x

is a vector of state variables, e.g., voltage and kinetic variables.

\vec F

is the time independent (homogeneous) flow field.

\vec\eta

is a (often assumed small), possibly time dependent (inhomogeneous) perturbation to the system.

\vec\theta

are system’s parameter. The change of solutions in response to variations in system’s parameter will be studied later.

and we assume the existance of a

{P}

-periodic limit cycle solution,

{\vec x_{\mathrm{LC}}}(t)={\vec x_{\mathrm{LC}}}(t+{P})

, also known as periodic orbit.

A neurobiological dogma states that action potentials follow the all-or-nothing principle¹. This can be construed to mean that the exact shape of the action potential does not matter² and that information is stored in the exact spike times³. It also means that the action potential should be stable under perturbations, but that inputs ought to be able to shift the occurrence of spikes in time. If you want, from the dogma and the intend to code into spike times follow two desiderata:

To wit, II. just means that stimulus induced phase shifts should neither decay nor blow up.

2.3 Limit cycle stability

Stability of is probed by studying small perturbation to an invariant set solution. Our invariant set is the limit cycle (periodic orbit)

{\vec x_{\mathrm{LC}}}

. Assuming there was a small perturbation to the system the solution can be decomposed as

with

\forall t:\|y(t)\|<\epsilon

some “small” perturbation to the orbit. What small, i.e.,

\epsilon

is we do not want to say now, maybe later, lets see …

Assuming the perturbation was transient (only in initial conditions) and the system is homogeneous again we plug the Ansatz of Eq. (3) into Eq. (2) and get

\frac{\mathrm{d}}{{\mathrm{d}}t}{\vec x_{\mathrm{LC}}}(t)+\dot{\vec y}(t)=\vec F({\vec x_{\mathrm{LC}}})+\underbrace{\nabla\vec F({\vec x_{\mathrm{LC}}})}_{{{{\underline J}}}({\vec x_{\mathrm{LC}}}(t))} \cdot \vec y(t)

The Jacobi matrix evaluated on the limit cycle can be written as a function of time

{{{\underline J}}}(t)=\nabla\vec F({\vec x_{\mathrm{LC}}}(t))

. Note that since the limit cycle solution is

{P}

-periodic, so is

{\underline J}(t)={\underline J}(t+{P})

Identifying the limit cycle solution above we are left with the first variational equation of Eq. (2)

Hence, one needs to study of linear system with periodic coefficients. One solution of Eq. (4) can be guessed, let us try the time derivative of the orbit,

\frac{{\mathrm{d}}{\vec x_{\mathrm{LC}}}}{{\mathrm{d}}t}{}

So it is a solution alright, and it happens to be a

{P}

-periodic solution. This solution is called the Goldstone mode. But for arbitray intitial conditions not all solutions should be periodic.

For the proof please consult (Chicone 2006), here we are just goint to work with this as an Ansatz. The constant matrix

{\underline \Lambda}

is called the Floquet matrix.

\dot{{\underline R}}e^{t{\underline \Lambda}}+{\underline R}(t){\underline \Lambda} e^{t{\underline \Lambda}}={\underline J}(t){\underline R}(t)e^{t{\underline \Lambda}}.

Which by multiplying with

e^{-t{\underline \Lambda}}

results in a dynamics equation for similarity matrix

{\underline R}

Remember that

{\underline R}

was invertable so one can also derive an equation for the inverse

2.3.1 Eigensystem of the Floquet matrix

The Floquet matrix,

{\underline \Lambda}

, is constant, though not necesarily symmetric. Hence it has an orthonormal left and right eigensystem

If we project the eigenvectors on the Eqs. (8) and (9) and use this definitions we get

If one projects the general solution

y(t)

from Eq. (6) on the the adjoint Floquet modes

\vec Z_k(t)\cdot\vec y(t)=\vec z_k{\underline R}^{-1}(t){\underline R}(t)e^{t{\underline \Lambda}}=\vec z_ke^{t{\underline \Lambda}}=\vec z_ke^{t\mu_k}

\nu_k<0

the perturbation decays exponentially in this rotated coordinate frame.

2.4 Neutral dimension and phase shifts

\frac{{\mathrm{d}}\phi}{{\mathrm{d}}t}=\nabla\phi(\vec x)\cdot\frac{{\mathrm{d}}\vec x}{{\mathrm{d}}t}

\frac{{\mathrm{d}}\phi}{{\mathrm{d}}t}=\nabla\phi({\vec x_{\mathrm{LC}}})\cdot\frac{{\mathrm{d}}{\vec x_{\mathrm{LC}}}}{{\mathrm{d}}t} =\nabla\phi\cdot\vec F({\vec x_{\mathrm{LC}}})+\nabla\phi\cdot\vec\eta({\vec x_{\mathrm{LC}}},t)

There are several ways to define a phase (Hilbert transform, linear interpolation, …). A desiderata could be to have a linear phase increase in the unperturbed case (

\vec\eta=0

), say

\phi(t)={f_0}t

. [… proto-phase]. From this desiderata it follows that one must have

\forall t:\nabla\phi\cdot\vec F({\vec x_{\mathrm{LC}}})={f_0}

. Given Eq. (16), this is easily achieved with the following identification

The input-output (I/O) equivalent phase oscillator to Eq. (1) can then be written as

3 Fourier theory

3.1 The Fourier base

One may think of the Fourier transform as a projection onto a new basis

e_\omega(t)=e^{{\mathrm{i}}\omega t}

. Define the projection of a function as

F(\omega)={\langle}e_w\cdot f{\rangle}=\int_{-\infty}^\infty dt\,e_\omega^*(t)f(t)

f(t)={\langle}e_t^*\cdot F{\rangle}=\int_{-\infty}^\infty d\omega\,e_t(\omega)F(\omega)

3.2 Existence of the Fourier integral

Usually all transcendental pondering about the existance of mathematical objects is the subject of pure math and should not bother us too much (we trust the work as been done properly). But in the Fourier transform case it motivates a subject we need to address mainly because of

\delta

-functions, which are heavily used in theoretical neurobiology, because they are reminestcent of action potentials or indicate the time of such a spike event. What does it mean for the Fourier transform to exist? It means that the integrals invoved in the definition of the Fourier transform converge to finite values. OK. So let us look at the magnitude of the transform of a function

f

. It can be upperbound by the Cauchy-Schwarz inequality

|{\langle}e_w\cdot f{\rangle}|=\left|\int_{-\infty}^\infty dt\; e^{-{\mathrm{i}}\omega t}f(t)\right| \leqslant \int_{-\infty}^\infty\underbrace{|e^{-{\mathrm{i}}\omega t}|}_{=1} |f(t)|\;dt= \int_{-\infty}^\infty |f(t)|\;dt.

then the Fourier integral exists – hurray. Of course, the same works for the integral involved in inverse Fourier transform. Note that this is an implication in one dirrection. All functions satisfying absolute integrability are lumped together in

L^1({I\!\!R})

The bad news is that applying a Fourier transform to one of the members of

L^1({I\!\!R})

can through you out of it. And then? Well, luckily there is the set of Schwartz functions, wich is closed under the Fourier transform.

3.2.1 Orthonormality

The issue that absolute integrability is insufficient already manifests when trying to Fourier transform it own basis, since

\int_{-\infty}^\infty dt|e^{{\mathrm{i}}\omega t}|=\infty

. Lets give it a name anyway

{\langle}e_\omega\cdot e_\nu{\rangle}=\int_{-\infty}^\infty dt\,e^{{\mathrm{i}}(\nu-\omega)t}=\delta(\nu-\omega)

\delta(\omega)=\int_{-\infty}^\infty dt\,e^{{\mathrm{i}}\omega t}=\int_{-\infty}^\infty dt\,e^{-{\mathrm{i}}\omega t}=\delta(-\omega)

3.2.2 Convolution

H(\omega)=\int dt\,e^{{\mathrm{i}}\omega t}\int dr\,f(t-r)g(r)=\int dr\,g(r)\int dt\,e^{-{\mathrm{i}}\omega(t+r)}f(r)

=\int dr\,g(r)e^{-{\mathrm{i}}\omega r}\int dt\,e^{-{\mathrm{i}}\omega r}f(t)=G(\omega)F(\omega)

3.2.3 Derivative

(\delta'*f(t))=\int\limits_{-\infty}^\infty\delta'(r)f(t-r)dr=[\delta(r)f(t-r)]_{r=-\infty}^\infty-\int\limits_{-\infty}^\infty\delta(r)f'(t-r)dr=-f'(t)

4 The continuum limit of a membrane patch

In this lecture ion channels are introduced as stochastic devices floating in the membrane of a nerve cell. It should motivate why the analysis techniques and models introduced in this lecture need to deal with fluctuations and noise. (Nerve) cells produce stochastic processes on several levels:

In the stationary state these processes can be subjected to spectral analysis.

Note that in 1952, the first equations describing membrane voltage dynamics where the deterministic rate equations by Hodgin & Huxley (Hodgkin and Huxley 1952). Only in 1994 Fox and Lu derived these equations from the continuum limit of an ensemble of stochastic ion channels (Fox and Lu 1994). Essentially by doing a diffusion approximation.

Since then the nuences of appllying diffusion approximations to neurons have been investigated (Linaro, Storace, and Giugliano 2011,Orio and Soudry (2012)) and reviewed (Goldwyn et al. 2011,Goldwyn and Shea-Brown (2011),Pezo, Soudry, and Orio (2014)).

4.1 The ion channel as a Markov model

Such dynamics can be described by a finite state Markov model which is mathematically described as a Master equation.

Starting with a simple ion channel that has an open conformation,

O

, in which ions can pass (say K⁺ ions) and a cloased states,

C

, which blocks ion flow

We can define a vector of the probability of being open and closed,

\vec p=[p_O,p_C]^{\dagger}

, respectively.

$\gamma_{\mathrm K^+}$ and $E_{\mathrm K^+}$ are the unitary conductance and the Nernst potential respectively. The average such current would be

where

N

is the total number of channels in the membrane patch under consideration. But what about a particular stochastic realisation of the current, what about the fluctuations around the mean?

If we have

N

channels than the number of

k

of them being open is binomially distributed

In actuality the channels are part of the membrane dynamical system, where

\alpha

and

\beta

depend at least on

v

and hence are not constant during a spike. We need an update rule how to get from the probability of being open at time

t

to the probability of begin open at time

t+dt

. This is given by

\begin{pmatrix}p_O(t+dt)\\p_C(t+dt)\end{pmatrix}=\begin{pmatrix}1-\alpha\,dt&\beta\,dt\\\alpha\,dt&1-\beta\,dt\end{pmatrix}\begin{pmatrix}p_O(t)\\p_C(t)\end{pmatrix}

\tau\dot p_O=p_O^{(\infty)}-p_O

with

\tau=\frac1{\alpha+\beta}

and

p_O^{(\infty)}=\frac\alpha{\alpha+\beta}

|\tilde p(\omega)|^2=\tilde p(\omega)\tilde p^*(\omega)=\frac{p^{(\infty)}}{1+(\tau\omega)^2}

4.1.1 The

n

-state channel

Let us try to calculate the statistics of the current originating from an

n

-state channel (just like in the two state case). Why would one do this? The idea, later on is to be able to find a continuous stochastic process that we can simulate and analyise easily.

Let

K(t)\in[1,...,n]

be the realisation of the

n

-state Markov channel. For example a K⁺-channel with four subunits

1 \underset{\beta}{\overset{4\alpha}{\rightleftharpoons}} 2 \underset{2\beta}{\overset{3\alpha}{\rightleftharpoons}} 3 \underset{3\beta}{\overset{2\alpha}{\rightleftharpoons}} 4 \underset{4\beta}{\overset{\alpha}{\rightleftharpoons}} 5

Assuming the ion channel has

n

conformations, of which one is conducting, let us further define

How does it evolve? Define

p_{i}(t)=P(K(t)=i)

and

\vec p(t)=[p_1,...,p_n]^{\dagger}

, then

\vec p(t)=e^{{\underline Q}t}\vec p(0)={\underbrace{(e^{{\underline Q}})}_{{\underline M}}}^t\vec p(0)={\underline M}^t\vec p(0)

Use the singular value decompostion,

{\underline M}={\underline U}\,{\underline \Sigma}\,{\underline V}^{\dagger}

, the matrix power can be written as

{\underline M}^t={\underline U}\,{\underline \Sigma}^t{\underline V}^{\dagger}

. Or (recall Eq. (7))

If Eq. (22) has a stationary distribution in the

t\to\infty

limit, then this must correspond to the eigenvalue

\nu_1=0

(let us assume they are ordered). So

\frac{\mathrm{d}}{{\mathrm{d}}t}\vec p(\infty)=0\Longrightarrow

for

\vec p(\infty)=\vec v_1,\exists\nu_1=0:{\underline Q}\vec v_1=\nu_0\vec v_1=0

. Therefore, the solution can be written as

{\langle}I(t){\rangle}=\gamma(E-v)\sum_{k=1}^n p_k(t)\delta_{1k}=\gamma(E-v)(p_1(\infty)+\sum_{k=2}^nu_{1k}v_{1k}p_1(0)e^{\nu_kt})

C_t(\Delta)={\langle}I(t)I(t+\Delta){\rangle}-{\langle}I{\rangle}^2=\gamma^2(E-v)^2\sum_{j,k=1}^n\delta_{1j}\delta_{1k}\,p_j(t)p_k(t+\Delta)-{\langle}I{\rangle}^2

C_t(\Delta)=p_1(t)p_1(t+\Delta)-{\langle}I{\rangle}^2=\sum_{i,j=2}^n[\vec u_i \underbrace{\vec v_i^{\dagger}\vec v_j}_{=\delta_{ij}}\vec u_i^{\dagger}]_{11}e^{\nu_it+\nu_j(t+\Delta)}

is a sum of exponentials. The spectral density is a superposition of Lorenzians.

4.1.2 Simulation a Jump process

Do not track individual channels but the channel numbers. Starting in a particular state

\vec N=[N_1,...,N_n]

at time

t

the life time of staying in that state until

t+\tau

where

a_k

are the rate of leaving state

k

. For example

a_3

in the K⁺-channel is

But which reaction did occur? Let

j

be the reaction (not the state!). For example, there are 8 reactions in the K⁺ channel. The probabilities of occurance associated with any one of them is

p(j)=\frac{N_j\zeta_j}{\sum_{k=1}^{N_{\mathrm{reac}}}N_k\zeta_k}=N_j\zeta_j/\lambda

In a computer algorithm one can draw the waiting time for the next reaction simply by generating a uniform random number

r_1\sim U(0,1)

and then

4.2 Statistically equivalent diffusion process (Orenstein-Uhlenbeck Process)

The jump process discussed in the previous sections is a continuous time Markov-process on a discrete domain,

K(t)\in{I\!\!N}

with

t\in{I\!\!R}

. A diffusion process is a continuous time Markov-process on a continuous domain,

\eta(t)\in{I\!\!R}

with

t\in{I\!\!R}

has the same first and second order statistics (in voltage clamp) as Eq. (21)? Let us try

{\mathrm{i}}\omega\tau\tilde\eta(\omega)=-\tilde\eta(\omega) + \sigma\,\chi(\omega)

The place holder symbold

\chi

was introduced for the Fourier transform of the stochastic process

\xi(t)

. Rearanging yields

\tilde \eta(\omega)\tilde \eta^*(\omega)=\frac{\sigma^2\chi(\omega)\chi^*(\omega)}{1+(\tau\omega)^2}

By definition

\chi(\omega)\chi^*(\omega)

is the Fourier Transform of the covariance function

\delta(\Delta)

and from Eq. (19) this is one. Hence,

leads to a correlation function with the same structure as in Eq. (25). We identify

\tau_i=1/\nu_i

and

\sigma_i=u_{1i}

The idea of matching the second order statistics can be formulated in a far more abstract way in terms of the Kramers-Moyal-van-Kampen expansion of the Master requation

\partial p(x,t)=\sum_{n=1}^\infty\frac{\partial^n}{\partial x^n} K_n(x,t)p(x,t)

5 Information theory for the living

This theory was never ment to be used to describe living systems in which meaning, i.e., the content of a message actually matter. Information theory deals with optimal compression, lossless transmission of signals irrespective of weather it is relevant or a TV program.

Nontheless a look at a quantitative science of communication may be insightfull.

Since there is a relation between the PRCs introduced in XXX and the filtering properties of a neuron, one can seek to do the same for phase dynamics and ask what PRC would maximise information transmission. But first, one needs to develop how the PRC predicts a lower bound on the mutual information rate. We begin with a short review of the basic tenets of information theory. Within information theory, a neural pathway is treated as a noisy communication channel in which inputs are transformed to neuronal responses and sent on:

5.1 The communication process

The science of communication is concerned with at least two subtopics:

(i)

the efficient representation of data (compression); and

(ii)

the save transmission of data through unreliable channels.

A source (the message) typically runns through the following processing sequence:

One of the formal assertions of information theory is that these two problems can be addressed separately (without loss of generality or efficiency): Meaning first one compresses by removing redundancies. Then one again adds failsafe redundancy to combat the unreliability.

However convenient for engineering, this does not mean that biological systems have to make use of this fact (source coding and channel coding could be very intertwined).

Also note that many of the mathematical results in information theory are bounds, inequalities not achievable in real systems. To get an idea of the mindset of information theory, check-out Shannon’s source coding theorem.

5.2 Source coding, data compression and efficient representation

Remember Morse’s code. Why does the character E only have a single symbol (

\cdot

), while the character Q (

--\cdot\,-

) has so many?

The idea could be that: Highly probable symbols should have the shortest representation, unlikely ones may occupy more space. What we want to compress is called the source and data compression depends on the source distribution. This sounds like a good idea for a biological system: Do not spend resources on rare events. Well not quite. Cicadas hear the sounds of their mating parners only once at the very end of a possibly 19 year long life.

In general, we can define a risk

\delta

, i.e., the probability of not having a code word for a letter. Then, to to be efficient, one should choose the smallest

\delta

-sufficient subset

S_\delta\subset A

such that

p(x\in S_\delta)\geqslant1-\delta

In the case of the genetic colde the essential bit content for

\delta = 0.077

H_{0.077}=4

or if we use the base

b=4

for the log it is 2, which is the length of the code words required for 16 AS.

Take an even simpler example: We have the alphabet

A=\{0,1\}

with probability distributions A:

\{p_0,p_1\}=\{\frac12,\frac12\}

and B:

\{p_0,p_1\}=\{\frac34,\frac14\}

. In figure~ you can see a plot of

\frac1NH_\delta

over the allowed error

\delta

for words of different length

N

For increasing

N

the curves depend on

\delta

to a lesser degree. Even more it converges to a line around the entropy

This is closely related concept of the typical set

T^N

. Members of the typical set

(x_1,..,x_N)=\vec x\in T^N

have

-\frac1N\log p({\vec x})\approx H(x).

Hence their probability is

p({\vec x})\approx 2^{-N\,H(x)}

, for all of them, which implies the cardinality of the typical set is

2^{N\,H(x)}

. There holds a kind of law of large numbers (Asymptotic Equipartition Property, AEP) for the typical set stating that: For

N

i.i.d. random variables

{\vec x}=(x_1,..,x_N)

with

N

large

{\vec x}

is almost certainly am member of

T^N

(

p({\vec x}\in T^N)=1-\varepsilon

Shannon argued that one should therefore base the compression on the typical set and showed that we can achieve a compression down to

NH

bits.

Note that the i.i.d. assumption is not to restrictive as we represent correlated processes in terms of the i.i.d. coefficients in their transform (or the empirical counter parts).

The typical set

T^\varepsilon_n=\{(x_1,..,x_n):|-1/n\log p(x_1,..,x_n)-H(X)|<\varepsilon\}

allows Shannon’s source coding algorithm: The encoder checks if the input sequence lies within the typical set; if yes, it outputs the index of the input sequence within the typical set; if not, the encoder outputs an arbitrary in

n(H+\varepsilon)

digit number.

By formalising this argument Shannon proved that compression rates up to the source entropy is possible. The converse, that compression below is impossible is a bit more involving.

5.3 Channel coding

\overset{\text{message}}{W}\overset{\text{encode}}{\to}X\ni x\to\overset{\text{noisy channel}}{p(y|x)}\to y\in Y\overset{\text{decode}}{\to}\overset{\text{est. Message}}{\hat W}

The rate

R

with which information can be transmitted over a channel without loss is measured

\frac{\text{Bits}}{\text{transmission}}

for a discrete time channel or

\frac{\text{Bits}}{\text{second}}

for a continuous time channel. Operationally, we wish all bits that are transmitted to be recovered with negligible probability of error.

${M}(X;Y)={H}(X)-{H}(X|Y)= -\sum\limits_{x\in X} p(x)\log_2 p(x) + \sum\limits_{x\in X\atop y\in Y}p(x,y)\log_2p(x|y)$.

This goes by the name of mutual information or transinformation. Remember marginalisation

${M}=-\sum_{x\in X\atop y\in Y} p(x,y)\log_2 p(x) + \sum_{x\in X\atop y\in Y}p(x,y)\log_2\left(\frac{p(x|y)p(y)}{p(y)}\right)$

The following Figure illustrates how the mutual information is related to respective (conditional) entropies of the input and output ensemble.

We have heard that

I(X;Y)

quantifies the statistical dependence of

X

and

Y

, but how is that related to error-free communication?

I(X;Y)

depends on the input ensemble. To focus on the properties of the channel we can simply take an ,,optimal’’ input ensemble and define the channel capacity

It will be left to the sender to actually find the optimal input statistics. Note that

I(X;Y)

is a concave function (

\cap

) of

p(x)

over a convex set of probabilities

\{p(x_i)\}

(this is relevant for procedures like the Arimoto-Blahut algorithm for estimating

C

) and hence a local maximum is a global maximum.

Shannon established that this capacity indeed measures the maximum amount of error-free information that can be transmitted. A trivial upper bound on the channel capacity is

This is due to the maximum entropy property of the uniform distribution in the discrete case:

5.4 Information transmission (continuous case)

Information theory has long been in the discussion as an ecological theory upon which to judge the performance of sensory processing (Atick 1992). This led Joseph Atick, Horace Barlow and others to postulate to use this theory to study how nervous systems adapt to the environment. The goal is to make quantitative predictions about what the connectivity in the nervous system and the structure of receptive fields should look like, for instance. In this sense, information theory was hoped to become the basis for an ecological theory of adaptation to environmental statistics (Atick 1992,Barlow (1961)).

Some of the signals in nature (and those applied in the laboratory) are continuous in time and alphabet. Does it make sense to extend the definition of the entropy as

Maybe. Let us see how far one gets with this definition. It is called differential entropy by the way. Through quantisation this can be related back to the entropy of discrete alphabets.

If the

p(x)

is smooth then one associates the probability of being in

i\Delta\leqslant x\leqslant(i+1)\Delta

with

{H}_\Delta=-\sum\limits_{i=-\infty}^\infty p_i\ln p_i=-\sum\limits_{i=-\infty}^\infty \Delta p(x_i)\ln(p(x_i)\Delta)

This is problematic as the second term goes to infinity for small quantisations. Formally, if

p(x)

is Rieman integrable, then

Since the infinitesimal limit is taken we can also take

n

to be the number of quantal intervals so that in the limit

With the mutual information being the difference of entropies the quantisation term vanishes.

A first example can be taken from rate coding⁶. In terms of our spike trains from Eq. (18), the instantaneous firing rate can be defined as

Let us pause and ask: What is the maximum entropy distribution for a continuous alphabet?

If we consider the whole spike train

y(t)=\sum_k\delta(t-{t^{\mathrm{sp}}}_k)

(see Eq. (18)) as the ouput, not just its mean input intensity and mean firing rate, we have a continuous time dependent stochastic processes to deal with. Note that the definition of the spike train, if integrated, is related to the empirical distribution function.

If we average infinitely many trials we get the instantaneous firing rate

r(t)={\langle}y(t){\rangle}_{y|x}

. We will give a mathematical definition of

r(t)

later. Our communication channel looks like that

The entropy rate of an ergodic process can be defined as the the entropy of the process at a given time conditioned on its past realisations in the limit of large time

The mutual information rate measures the amount of information a neural pathway transmits about an input signal

x(t)

is the mutual information rate,

between the stochastic process,

x(t)

, and the stochastic response process,

y(t)

. The entropy rate

{H}

measures the number of discriminable input or output states, either by themselves, or conditioned on other variables.

The mutual information rates, which is the difference between unconditional and conditional entropy rates, characterises the number of input states that can be distinguished upon observing the output. The response entropy rates

{H}[y]

, for instance, quantifies the number of typical responses per unit time, while

{H}[x|y]

is a measure of the decoding noise in the model. If this noise is zero, then the mutual information rate is simply

{H}[x]

, provided that this is finite.

The conditional entropy rates

{H}[y|x]

and

{H}[x|y]

, characterising the noise in the encoding and decoding model respectively, are each greater than zero. In information theory, these quantities are also called equivocation. Hence, both the stimulus and response entropy rates,

{H}[x]

and

{H}[y]

, are upper bounds for the transmitted information.

The best estimator is the mean, so the statisticians say. Therefore a lower bound to the estimation error is given by

5.5 Linear stimulus reconstruction and a lower bound on the information rate (decoding view)

Without a complete probablilistic description of the model the mutual information can not be calculated. And even with a model the involved integrals may not be tracktable. At least two strategies to estimate it exist, though: Either, create a statistical ensemble of inputs and outputs by stimulation, followed by (histogram based) estimation techniques for the mutual information; or, find bounds on the information that can be evaluated more easily. In general, the estimation of mutual information from empirical data is difficult, as the sample size should be much larger than the size of the alphabet. Indeed, each element of the alphabet should be sampled multiple times so that the underlying statistical distribution can, in principle, be accurately estimated. But this prerequisite is often violated, so some techniques of estimating the information from data directly rely on extrapolation (???). The problem becomes particularly hairy when the alphabet is continuous or a temporal processes had to be discretised, resulting in large alphabets.

Another approach, which will allow us to perform a theoretical analysis of phase dynamics, relies on a comparison of the neuronal ``channel’’ to the continuous Gaussian channel (???,Cpt.~13) is analytically solvable (Cover and Thomas 2012). The approach can be used to estimate the information transmission of neuronal models (???). Also experimental system have ben analysed in this way, e.g.:

It was prooven that in that this method leads to a guaranteed lower bound of the actual information transmitted (???).

If one has experimental control of the stimulus ensemble it can choosen to be a Gaussian process with a flat spectrum up to a cutoff as to not introduce biases for certain frequency bands. The mutual information between stimulus

x(t)

and response

y(t)

can be bound from below as

Here,

{H}_\text{gauss}[x|y]

is the equivocation of a process with the same mean and covariance structure as the original decoding noise, but with Gaussian statistics. The conditional entropy of the stimulus given the response is also called reconstruction noise entropy. It reflects the uncertainty remaining about the stimulus when particular responses have been observed.

It turns out that the inequality in Eq. (33) also holds if the estimator is conditioned. Say from the output of the neuron we estimate its input

{\langle}(x(t)-\hat x(t))^2{\rangle}_{x|y}\geqslant\inf\limits_{\hat x}{\langle}(x(t)-\hat x(t))^2{\rangle}_{x|y}={\langle}(x(t)-{\langle}x(t){\rangle}_{x|y})^2{\rangle}_{x|y}=e^{2{H}_\text{gauss}[x|y]}.

The second line uses the fact that in this case the optimal estimator is given by the conditional mean. We have the following bound on the equivocation

The deviation between stimulus and its estimate,

n(t)=x(t)-\hat x(t)

, is treated as the noise process.

In order to obtain a tight bound the estimator

\hat x(t)

should be as close to optimal as possible. For the case of additional information given by the response of the neural system

y(t)

to the process

x(t)

, the estimator should make use of it,

\hat x_t[y]

. For simplicity one can assume it is carried out by a filtering operation,

\hat x(t)= (f*y)(t)

specified later (Gabbiani and Koch 1998). Like the whole system the noise process is stationary, and its power spectral density,

P_{nn}(\omega)

, is

${H}_\text{gauss}[x|y]\leqslant {{\textstyle\frac12}}\ln{\langle}n^2(t){\rangle}={{\textstyle\frac12}}\int_{-\infty}^\infty\frac{{\mathrm{d}}\omega}{2\pi}\ln P_{nn}(\omega).$

So as to render the inequality in Eq. (35) as tight a bound as possible one should use the optimal reconstruction filter from

y

\hat x

. In other words, it is necessary to extract as much information about $x$ from the spike train $y$ as possible.

The next step should be to find an expression for the noise spectrum,

P_{nn}(\omega)

, based on the idea of ideal reconstruction of the stimulus. As opposed to the forward filter, the reconstruction filter depends on the stimulus statistics (even without effects such as adaptation). We seek the filter

h

that minimises the variance of the mean square error

Taking the variational derivative of the error w.r.t.
the filter (coefficients)

h(\tau)

and equating this to zero one obtains the orthogonality condition for the optimal Wiener filter (???)

Inserting the definition of the error,

n(t)=x(t)-\hat x(t)

, into Eq. (38) yields

{\langle}x(t)y(t-\tau){\rangle}-\int{\mathrm{d}}\tau_1\,h(l) {\langle}r(t-\tau_1)r(t-\tau){\rangle}= R_{xy}(\tau) - (h\ast R_{yy})(\tau) = 0

In order to obtain

h

we need to deconvolve the equation, which amounts to a division in the Fourier domain

To compute the mutual information rate, we now calculate the full auto-correlation of the noise when the filter is given by Eq. (39). For an arbitrary filter

h(t)

, we have

R_{nn}(\tau)={\langle}n(t)n(t+\tau){\rangle}={\langle}n(t)x(t+\tau){\rangle}-\int{\mathrm{d}}\tau_1\,h(\tau_1) {\langle}n(t)y(t+\tau-\tau_1){\rangle}

When the orthogonality condition of Eq. (38) holds, the right-most term vanishes. Proceeding by expanding the first term algebraically leads to an expression for the noise correlations

R_{nn}(\tau)={\langle}n(t)x(t+\tau){\rangle}=R_{xx}(\tau)-\int{\mathrm{d}}\tau_1\,h(\tau_1)R_{xy}(\tau-\tau_1)

This expression can be Fourier transformed in order to obtain the required noise spectrum

P_{nn}(\omega)=P_{xx}(\omega)-H(\omega)P_{xy}(\omega)=P_{xx}(\omega)-\frac{|P_{xy}(\omega)|^2}{P_{yy}(\omega)}

where the definition of the optimal filter, Eq. (39), was utilised. This result can then be inserted into Eq. (36) to obtain the following well known bound on the information rate (???,lindner2005j:mi,holden1976b,stein1972j:coherenceInfo)

This information bound involves only spectra and cross-spectra of the communication channel’s input and output processes which are experimentally measurable in macroscopic recordings . The channel, in this case the neuron, can remain a black box. But since we can bridge the divide between microscopic, biophysical models and their filtering properties, we will, in the following section, derive the mutual information rates.

What renders the coherence a useful quanity? While the cross-spectrum informs us when stimulus and output have correlated power in a spectral band, the normlisation with the output auto-spectrum can be crucial. Say we find a particular high power in

P_{xy}(\omega)

, this may not be ralted to the stimulus but could just be the intrinsic frequency of the neuron itself.

The coherence is a quantity without mention of the explicite decoding filter, in fact it is symmetric in input and output just as the mutual information. This is beneficial because one can now take the encoding view in the next chapter.

6 Linear response filter

The stimulus spectral density is given by the environment or controlled by the experimental setup, while cross- and output spectra need to be measured or calculated from the model in question. In this lecture cross-spectral and spike train spectral densities are derived from phase oscillator, see Eq. (17), that are in turn derived from biophysical models. This means we do not treat the channel as a blackbox but assume a particular model.

The first quantity we need to calculate Eq. (41) is the cross-spactrum. On the one hand it is the Fourier of the cross-corrlation, on the other it can be written as averages of the Fourier transforms (FT and average are linear operation).

What has happened here? The cross-spectrum can be obtained by averaging the Fourier transformed quantities over trials and the stimulus ensemble. The average can be split into the conditinal average over trials

{\langle}\cdot{\rangle}_{y|x}

, given a fixed, frozen stimulus and the average over the stimulus ensemble, (

{\langle}\cdot{\rangle}_x

). The former is essential an average over the encoding noise (Chacron, Lindner, and Longtin 2004,Lindner, Chacron, and Longtin (2005)).
Observe that

{\langle}\tilde y(\omega){\rangle}_{y|x}

is Fourier transform of the trial averaged firing rate conditional on a frozen stimulus

Thus, it is sufficient to derive a filter that maps input

x(t)

to a firing rate, not an individual spike train.

This shows us that although we are computing the cross-spectrum of stimulus and spike train the response filter

G({\mathrm{i}}\omega)

for the instantaneous firing rate suffices. This simple relation reminds us of the fact that the cross-spectrum is not really a second order quantity, but can be exactly determined by linear response theory. The spike train spectrum

P_{yy}(\omega)

, on the other hand, is truly a second order quantity, viz, the Fourier transform of the auto covariance, and can not be related to the linear response filter without further approximations.

6.1 Instantaneous firing rate in the continuum limit

For (i) one can simulate many paths of a stochastic differential equation, with different intitial conditions and noise realisations. Histograms can provide the ensemble statistics. But it is also possible to find an evolution equation for the whole ensemble.

$4\Delta r(k\Delta)$ :	3	1	3	1	1	2	1
Trials:	1	0	1	0	0	1	0
	0	1	1	0	0	0	1
	1	0	0	1	0	1	0
	1	0	1	0	1	0	0

where the average is over all paths,

x(t)

, and therefore over all realisations of the noise process

\xi(t)

\frac\partial{\partial t}\varrho(y,t)=\dot x(t)\frac\partial{\partial x} \varrho(y,t) =-\frac\partial{\partial y}\varrho(y,t)u(x(t),t)

Solving such a PDE involves time integration or other integral transformations (Fourier and Laplace’s). Since

\int{\mathrm{d}}y\,\delta(x(t)-y)f(y)=f(x(t))=\int{\mathrm{d}}y\,\delta(x(t)-y)f(x(t))

\frac\partial{\partial t}p(y,t)=-\frac\partial{\partial y}f(y)p(y,t)-\frac\partial{\partial y}g(y){\langle}\xi(t)\varrho(y,t){\rangle}

The correlation between a stochastic process

\xi(t)

and a nonlinear functional of it is given by the Novikov-Furutsu-Donsker formula

6.2 Phase flux = firing rate

Here, as opposed to Eq. (17)

\vec Z(\phi)\cdot\vec\eta(\phi,t)

was split into the part that results from the presented stimulus, now denoted

x(t)

, and the part that originated from, e.g. intrinsic noise. From Eq. (26) the perturbation vector has

\vec\eta = \begin{pmatrix} x(t)\\\sigma_1(v(\phi))\xi_1(t)\\\vdots\end{pmatrix}

As long as the intrinic noise is fast enough compared to the stimulus an averaging can be applied¹⁰ to obtain an effective diffusion

which enters Eq. (53). The benefit is that the corresponding The Fokker-Planck equation

is tractable in a perturbation expansion. But first, remember what is the goal: Identification of the forward filter

g(t)

r(t)=\int_{-\infty}^t{\mathrm{d}}r\,g(r)x(t-r)

r(t)=J(1,t)=({f_0}+Z(\phi)x(t))p(\phi,t)-\frac{\sigma^2}2\frac\partial{\partial\phi}p(\phi,t)\Big|_{\phi=1}

The

p_i(\phi,t)

are assumed small correctin terms, given that the stimulus

x(t)

is also small.

={f_0}P_1(k,s)+\frac{s-\nu_k}{{\mathrm{i}}2\pi k}P_1(k,s)-\frac{{\mathrm{i}}2\pi k\sigma^2}2P_1(k,s)

{\mathrm{i}}2\pi kJ_1(k,s)={\mathrm{i}}2\pi k{f_0}P_1(k,s)+\frac{(2\pi k\sigma)^2}2P_1(k,s)+(s-\nu_k)P_1(k,s)=sP_1(k,s)

The power spectrum correponds to the imaginary axis,

G({\mathrm{i}}\omega)

. The low frequency limit is

\lim_{\omega\to\infty}G({\mathrm{i}}\omega)=\sum_{k=-\infty}^\infty a_k=Z(0)

7 Numerical continuation of fixpoints and orbits

In the past lectures functional consequences (e.g. filter properties) had been derived from the phase response curve of neurons. The PRCs particular shape (e.g. its mean or value at

phi=0

) had consequences on what the neuron can do computationally. Next we need do gain some insight into “how a PRC looks like in particular dynamical regimes”. These regimes are reached by changing some system parameter, e.g. the membranes leak conductance or capacitance, the composition of ion channels or their kinetic properties.

Often numerical solutions to nonlinear ordinary differential equations are found by (forward) time-integration. An interesting alternative is to track a found solution through parameter space, for which the solution must be persistent. If it is not, then a bifurcation occurs and one observes a qualitative change of the solution.

For book on bifurcation theory consult (Izhikevich 2007) and numerical bifurcation analysis (Kielhöfer 2011).

7.1 Continuation of fixed points

The fixpoint solution

\vec x(p)

depends on the parameter

p

. The existence of the solution as a function of the parameter is governed by the implicite function theorem.

This establishes the existence of lines in a bifurcation diagram.
The method of continuation is a predictor-corrector method. In practice, assume the fixpoint is known for one particular parameter value

p_0

, then for a small change in parameter the solution is is predicted

To predict the new solution, the old solution is required and the derivative of the solution w.r.t. the parameter that is changed. How to compute the latter? Take the total derivative of Eq. (57) w.r.t. the parameter

(\nabla_xf)\frac{\partial\vec x}{\partial p}+\frac{\partial\vec f}{\partial p}=0

\frac{\partial\vec x}{\partial p}=-(\nabla_x\vec f)^{-1} \frac{\partial\vec f}{\partial p}

\nabla_x\vec f

is full rank one can use some efficient linear algebra library to find the vector

\frac{\partial\vec x}{\partial p}

and back insert it into Eq. (58). For too large

\delta p

the predicted solution will be wrong. Yet, it is a good initial guess form which to find the correct version.

Convergence analysis of Newton iterations yields that with each newton iterations the number of correct decimal places doubles. Hence often a low number of iterations (3-7) suffice to achieve sufficient numerical accuracy.

Or more general: What happens if we find that the condition number of

\nabla\vec f

explodes?

In the example above the branch switches its stability and it bends back in a fold or turning point. In general folds can occure with and without stability change.

7.2 Local bifurcations: What happens if

\nabla\vec f

is singular?

7.2.1 Folds and one-dimensional nullspaces

This section considers

\dim N({\underline J})=1

. Hence, the implicite function theorem is not applicable. From the example above it is apparent that

\vec x'(p)=\infty

at the bifurcation. The problem can be circumvented by defining a new “arclength parameter”,

p(s)

. The bifurcation branch is then a parametric curve,

(\vec x(s),p(s))

. Without loss of generality the bifurcation is to be at

s=0

If the Jacobian matrix

{\underline J}=\nabla\vec f

is rank-deficient the Lyapunov-Schmidt reduction can be applied. Intutiefely the problem is reduced from a high-dimensional, possibly infinite-dimensional, one to one that has as many dimension as the deffect of

\nabla\vec f

The nullspace is spanned by the eigenvector,

r_0

{\underline J}

, corresponding to the eigenvalue 0.

Assume that

f

is twice differentiable w.r.t.

\vec x

, then differentiate

\vec f(\vec x(s),p(s))=0

w.r.t.

s

and evaluate at

s=0

\frac{\mathrm{d}}{{\mathrm{d}}s}\vec f(\vec x(s),p(s))=\nabla_x\vec fx'(s)+\partial_p\vec fp'(s)={\underline J}\vec x'(s) + \partial_p\vec fp'(s)

\frac{{\mathrm{d}}^2}{{\mathrm{d}}s}\vec f={\underline H}\vec x'(s)\vec x'(s)+{\underline J}\vec x''(s)+\partial^2_p\vec fp'(s)+\partial_p\vec fp''(s)=0

s=0

one has

p'(0)=0

and

\vec x'(0)=\vec r_0

. Projecting onto the left-eigenvector

\vec l_0

to the eigenvalue 0. at

s=0

one finds

p''(0)=-\frac{\vec l_0{\underline H}\vec r_0\vec r_0}{\vec l_0\partial_p\vec f}

7.3 Stability exchange

At a simple fold one eigenvalue is zero. Study the eigen system of

{\underline J}=\nabla\vec f(\vec x(s),p(s))

near

s=0

7.3.1 Extended system

7.4 Continuation of boundary value problems and periodic orbits

The same procedure than above can be applied to the continuation of periodic orbits and boundary value problems. Define the implicit function to encompass the time derivative

\vec g(\vec x,p)=\frac{\mathrm{d}}{{\mathrm{d}}t}\vec x-T\vec f(\vec x,p)=0

, with

t\in[0,1]

Then proceed as above. Note that the time derivative

{\mathrm{d}}/{\mathrm{d}}t

is a linear operator which has a matrix representation just like the Jacobian. in that sense

\nabla_x\frac{\mathrm{d}}{{\mathrm{d}}t}\vec x=\frac{\mathrm{d}}{{\mathrm{d}}t}{\underline I}

8 PRC near the centre manifold

8.1 Dynamics in the centre manifold of a saddle-node

At arbitrary parameters the periodic soluiton adjoint to the first variational equation on the limit cycle yields the PRC. I can be calculated numerically with the continuation method. Near a bifurcation, however, if major parts of the dynamics happen in the centre manifold the PRC can be calculated analytically. As an example take the saddle-node on limit cycle bifurcation (SNLC). The spike in this case is a homoclinic orbit to a saddle-node, that enters and leaves via the semi-stable (centre) manifold that is associated with the eigenvalue

\lambda_0=0

of the Jacobian at the saddle.

Expanding the right-hand side around saddle-node fixpoint,

\vec x_0

, yields (Ermentrout 1996)

\vec f(\vec x)={\underline J}(\vec x-\vec x_0)+{\underline H}(\vec x-\vec x_0)(\vec x-\vec x_0)+...

Establish the eigen system at the saddle-node

{\underline J}=\nabla\vec f(\vec x_0)

\vec l_k{\underline J}=\lambda_k\vec l_k

and

{\underline J}\vec r_k=\lambda_k\vec r_k

with

l_j\cdot r_k=\delta_{jk}

By assumption of a saddle-node the Jacobian has a simple zero with an associated eigenvector.

Write the dynamics arround the saddle-node as

\vec x(t) = \vec x_0 + y\vec r_0

. then Eq. (60) is

\dot{\vec{x}_0}+\dot y\vec r_0=y{\underline J}\vec r_0+p\vec g(\vec x_0)+y^2{\underline H}\vec r_0\vec r_0

Projecting this equation onto the left eigenvector

\vec l_0

yields the isolated dynamics along the centre manifold:

The centre manifold is only tangential to the spiking limit cycle dynamics near the saddle. Although the proportion of time spend near the saddle is large at some point the trajectories depart so it is only locally valid.

Away from the saddle-node the slow manifold accelerates again and the variable

y

explodes at

\pi/2

. This blowing up is like a spike. So for the SNLC centre manifold one can think of the the spike at

y=\infty

and reset to

y=-\infty

. The time it takes from

y=-\infty

y=\infty

is finite and given by

The bifurcation parameter enter in

a

. If

q\vec g(\vec x)=[c^{-1}I,0,0,...]^{{\mathsf{T}}}

, then the firing rate has the typical square-root scaling

8.2 PRC near the centre manifold

PRCs of simple 1D reset-models like quadratic equation of Eq (61) can be calculated. The PRC is defined as the phase gradient (Winfree 2001,Kuramoto (1984)) with respect to the state variables. In the unperturbed case we have

\dot\phi=f_0

, where

f_0

is the frequency. The PRC is

Z(\phi)=\frac{\partial\phi}{\partial y}=\dot\phi\frac{\partial t}{\partial x}=\frac{f_0}{\dot y}=\frac{f_0}{a+by^2({f_0}\phi)}

Z(\phi)=\frac{f_0}{a+a\tan^2(\pi{f_0}\phi-\pi/2)}=a^{-1}{f_0}\cos^2(\pi{f_0}\phi-\pi/2)

Note that

a

depends on the bifurcation parameter, yet

b

does not. Hence it may be preferable to write

9 Phase suscetability to channel noise near saddle-node loop bifurcations

The Jacobian is

{\underline J}=\nabla\vec F

has eigen system

\vec l_k{\underline J} = \lambda_k\vec l_k

and

{\underline J}\vec r_k = \lambda_k\vec r_k

, with

\vec l_k\cdot\vec r_j=\delta_{kj}

The PRC for perturbation along the centre manifold of the SNIC bifurcation is long known. In order to understand perturbations originating from the gating dynamics of different ion channels we need the PRC for arbitrary perturbations.

The following holds not only for homoclinic orbits approaching the saddle-node via the semistable centre manifold (the SNIC case), but any homoclinic orbit that approaches via any of the stable manifolds.

The Floquet modes are periodic solution to the first variational system on the limit cycle

10 Event time series and phase descriptions

10.1 Synchronisation and phase locking

If biological oscillators interact their emitted event time series may synchronise. Start with a conductance based models

In Eq (17) the I/O equivalent phase oscillator was derived. Take two phase oscillators

i

and

j

that are I/O equivalent to Eq (66)

If the frequency detuning,

\Delta f

is small, then

\psi

is a much slower variable than

\phi_i

and

\phi_j

. Therefore, The variable

\phi_j

in Eq. (67) traverses many periods before

\psi

changes. In other words:

\psi

“sees” only the average of

\phi_j

. One may apply averaging theory to define

H_\mathrm{SNL}^\mathrm{odd}(\psi)=(1-\cos\pi(1+\psi))-(1-\cos(\pi(1+\psi)+\pi)) = 2\cos\pi\psi

10.2 Spike reliability and stimulus entrainment

Not all neuronal populations are coupled. Some sensory neurons like auditory receptors do not have lateral inhibition and just project to the next processing stage. Still many of these neuron receive the same sensory input. Therefore, one can study two neurons that share a common input, or, equivalently one neuron that is stimulated in repetitive trials with the same protocol.

Take two neurons

i

and

j

. Each neuron has its own intrinsic noise-level

\sigma_i

, but all share and a common stimulus

x(t)

and mean firing rate

f

Remember the neuronal codes. A spike time code was a mapping from a time continuous stimulus

x(t)

to a spike train

y(t)

, i.e., the ouput is a high dimensional vector of spike times. In the following the stimulus that is common to all neurons

i

, is assumed to be a small amplitude zero-mean Gaussian process,

{\langle}x(t){\rangle}=0

, with wide-sense stationary covariance

C(t-r)={\langle}x(t)x(r){\rangle}

. The intrinsic noise has different realisation for each neuron.

These are decoding questions. They are quantified by looking at spike-train metrics (Rossum 2001). In a non-stationary environment on other question maybe useful to ask:

This question it important for up-stream neuron, too, since it determines the minimal window of integration that is required.

Neurons¹² are perfectly inphase-locking, if their phase difference is zero,

\psi=\phi_i-\phi_j=0

, and stays that way,

\dot\psi=0

. For simplicity look at the

\sigma=0

case. WLOG take

\phi_j

as the reference. So, in the present case the phase difference evolves as

In a homogenous (time independent), system information about how fast the locked state is reached and how stable it is is given by the Lyaponov exponent,

\lambda

of the phase difference.

Since there a time-continuous stimulus present one can only define an (time) averaged Lyapunov exponent

Assume that the neurons are already similar in their phase dynamics, then the right hand side phase difference, Eq. (69) can be expanded arround

\phi_j

to obtain

In the absence of intrinsic noise,

\bar\sigma=0

, the averaged Lyapunov exponent from Eq (70) is

Note that this is a case for the Novikov-Furutsu-Donsker formula, because

Z'(\phi(t))

is a nonlinear function of a stochastic process,

\phi(t)

that depends on the stochastic process

x(t)

. Therefore,

$\bar\lambda={\langle}Z'(\phi_j)x(t){\rangle}=\int_{-\infty}^t{\mathrm{d}}r\,C(t-r)\left{\langle}\frac{\delta Z'(\phi_j(t))}{\delta x(r)}\right{\rangle}$.

\bar\lambda=\int_{-\infty}^t{\mathrm{d}}r\,C(t-r){\langle}Z''(\phi_j(t))Z(\phi_j(r)){\rangle}

There are different approaches to calculate the remaining average. In an ergodic system the ensemble average can be replaced by temporal averaging,

{\langle}{\rangle}=\lim_{T\to\infty}\frac1T\int_0^T{\mathrm{d}}t

, so one gets

\bar\lambda=\lim_{T\to\infty}\frac1T\int_0^T{\mathrm{d}}t\int^t_{-\infty}{\mathrm{d}}r\,C(t-r)Z''(\phi_j(t))Z(\phi_j(r))

Further expansion in

x(t)

, i.e. to lowest order

\phi_j(t)={f_0}t

, one finds

\bar\lambda=\lim_{T\to\infty}\int_0^T{\mathrm{d}}t\int_{-\infty}^t{\mathrm{d}}r\,C(t-r)Z''({f_0}t)Z({f_0}r)

In phase locking is guaranteed but the locked state is reached faster if the PRC has large derivatives, i.e. higher Fourier components!

Several alternative derivations exist (Teramae and Tanaka 2004,D. S. Goldobin and Pikovsky (2005),Denis S. Goldobin and Pikovsky (2005))

10.3 Inter-spike statistics

10.3.1 First passage time (no input and constant noise)

\dot p(\phi,t)=\frac{\sigma^2}2\partial^2_\phi p(\phi,t)+f\partial_\phi p(\phi,t)

s.t. BC

p(1,t)=1

P(\phi,s)=\exp\{\frac{{f_0}}{\sigma^2}(1-\sqrt{1+2s\sigma^2/{f_0}^2})(1-\phi)\}

Many neuron that do not have slow channels (adaptation, pumps, …) can be fitted with this ISI distribution.

10.3.2 Phase dependent noise

Ion channel noise is not constant throughout the inter spike interval. In a simple two state channel the voltage dependent noise variance is

where

N

is the number of ion channels and

n_\infty

is the steady state activation. Hence one may which to analyse

This can not be solved in general but the moments of the ISI distribution are simpler.

The following conserns equations for the statistics of waiting times as in Ref. (Gardiner 2004). Let

y(t)

be the process in question, denoting a voltage or a phase variable. The process starts at a particular

y_0

, i.e. the initial distribution is

and one is interested in the distribution of times for the process to reach

y=y_1

If the process is governed by a stochastic differential equation, then it was shown that the associated density

p(y,t)

is propagated by a specific evolution operator

This equation was called the Fokker-Planck equation (FPE, see Eq (51)). Denote the solution of a homogeneous FPE with starting distribution concentrated at one value

y_0

p(y,t;y_0,t_0)

such that

p(y,t_0;y_0,t_0)=\delta(y-y_0)

and write its formal solution as

The goal is to find a relation between the ISI distribution of the neuron model and the FP operator. For that assume the process lives in an interval

(y_1,y_2)

, where

y_2

could denote the spike threshold and

y_1

the resting potential to which an IF-like neuron resets, or the two boundaries encapsulating the periodic domain of the phase oscillator interval, e.g.

y_1=0

and

y_2=1

. At time

t_0

, the system is supposed to start inside the interval,

y_1\leqslant y_0\leqslant y_2

. The probability at time

t>t_0

of still being inside the interval

(y_1,y_2)

, and thus no spike occurring, is (Gardiner 2004)

G(y_0,t)=\Pr(y_1\!\leqslant\!y(t)\!\leqslant\!y_2)=\int\limits_{y_1}^{y_2}{\mathrm{d}}\tilde y\;p(\tilde y,t;y_0,t_0)

with additional condition

G(y_0,t_0)\!=\!1

because we started with

y_0\in[y_1,y_2]

. The time derivative of

G(y_0,t)

, i.e. the change in the probability of remaining within

(y_1,y_2)

, at any given

t

measures the exit rate or probability. It is also called

The goal is to find an evolution equaiton for

q(t,y_0)

. With the help of the formal solution in Eq. (74), it can be shown that the inner product¹⁴ of

h(y,t)=G(y,-t)

and

p(y,t;y_0,t_0)

is constant

{\langle}h,p{\rangle}=\int{\mathrm{d}}y\;h(y,t)p(y,t;y_0,t_0)=\iint{\mathrm{d}}y{\mathrm{d}}\tilde y\,p(\tilde y,-t;y,t_0)p(y,t;y_0,t_0)

Note that the operator $e^{t{{\mathcal F}}}$ commutes with the identity operator

\delta(y-\tilde y)

. Taking the time derivative of this constant and using $\dot p={{\mathcal F}}p$ one obtains

$\partial_t{\langle}h,p{\rangle}= {\langle}\dot h,p{\rangle}+{\langle}h,\dot p{\rangle}={\langle}\dot h,p{\rangle}+{\langle}{{\mathcal F}}^\dagger h,p{\rangle}=0$.

Because

p

may change according to its initial conditions, the last expression implies that $\dot h=-{{\mathcal F}}^\dagger h$, or that

G(y,t)

is a solution to the equation (Gardiner 2004)

$\dot G(y,t)={{\mathcal F}}^\dagger G(y,t)$, s.t.

G(y,T_0)=1\!\mathrm{I}_{[y_1,y_2]}(y)

The adjoint operator ${{\mathcal F}}^\dagger$ is also called the infinitesimal generator of the stochastic process. In addition to the boundary condition above, trivially stating that if we start in the boundary the initial probability of inside is one, one may include reflecting boundary conditions at the lower end

\partial_yG(y,t)|_{y=y_1}=0

and absorbing boundary conditions at the upper end

G(y_2,t)=0

Because partial derivatives and integrals are both linear operators, the equation for

q

directly reads the same

Since one of the main objectives in this document is to establish links between the microscopic noise sources such as channel noise and the macroscopic spike jitter one may immediate pose the question: How much information about the underlying diffusion process can we extract from first passage time densities like the ISI distribution? Might there be a unique diffusion process generating it? A sobering answer to the second question was given in (???): No—the solution is not unique, there are several possible diffusion processes that may lead to one and the same passage time density.

Yet, not all is lost. If one takes into account constrains from membrane biophysics, then diffusion process derived is not completely arbitrary. In fact, if the model is derived from first principles, then the free parameters in the model can be related to the ISI statistics.

10.3.3 Moments of the ISI distribution

Instead of attempting to obtain the complete ISI distribution by solving the adjoint Fokker-Planck equation, Eq (76) one may content oneself with the first two moments or the coefficient of variation, which one uses to quantify spike jitter in experiments. Let us set

t_0=0

and Denote the

n^\text{th}

moment of the ISI distribution as (Gardiner 2004)

T_n(y)=\int_0^\infty{\mathrm{d}}\tau\;\tau^n q(\tau,y)=-\int_0^\infty{\mathrm{d}}\tau\;\tau^{n-1}G(y,\tau)

where the fact was used that for the finite interval

[y_1,y_2]

exit is guaranteed, i.e.,

G(y_0,\infty)=0

. one may multiply both side of Eq. (76) with

t^n

and integrate to obtain a recursive ordinary differential equation for the moments

Here we have imposed reflecting boundary conditions on the lower limit

y_1

and absorbing boundary conditions on the upper limit

y_2

. These conditions are in agreement with an IF model, which once reaching the spike threshold is reset an unable to move inversely through the spike. As we discussed in the beginning of they can also be applied as an approximation to the phase oscillator if the noise is weak. This equation is also a direct consequence of the Feynman-Kac formula.

In Cpt.~ the Eq.~ will be used to calculate ISI moments of conductance based neurons using a phase reduction. Suppose we have an FP operator ${{\mathcal F}}(\phi)$ for the equivalent phase variable that is accurate to order

\varepsilon^k

in the noise. Then all moment,

T_k

, up to order the

k^{\textrm{th}}

can be obtained accurately. For example if one is interested in ISI variance, the method will require finding a suitable SDEs for the phase variable

\phi(t)

that gives the FP operator to second order.

10.4 Moments of the inter-spike interval distribution

The

n^\mathrm{th}

moment,

T_n(\phi)|_{\phi=1}

, of the first passage time density is the solution to (Gardiner 2004)

where the Fokker-Planck backwards operator for the Stratonovich SDE in Eq ((???)) is

F(\phi)=\vec\sigma(\phi)\cdot\frac{\mathrm{d}}{{\mathrm{d}}\phi}\vec\sigma(\phi)\frac{\mathrm{d}}{{\mathrm{d}}\phi}+{f_0}\frac{{\mathrm{d}}}{{\mathrm{d}}\phi}.

Assuming that

\forall\phi:\epsilon(\phi)={f_0}^{-1}\vec\sigma(\phi)\ll1

, solutions

T_n(\phi)=T_{n1}+T_{n2}+...

can be sought in a perturbative manner.

10.5 Renewal equation

In a renewal process, all inter-spike intervals are independent, as though each is separately drawn form the ISI distribution. But slow kinetic processes in the neuronal dynamics or long-term correlations in the external stimulus could make the spike train have negative or positive correlations. A point process with such properties would be called a non-renewal.

The ISI distribution alone does not tell us about the correlation between consecutive interspike intervals. Are they independent, negatively or positively correlated? Several types of adaptation currents have time scales spanning orders of magnitude above the spiking period and indeed there contribution to ISI correlations have been analysed . But, for the sake of simplicity, we will ignore the effects on longer times scales and consider a spike train as arising form a renewal process.

In the following we treat the neuron as a threshold device such as an integrate-and-fire neuron or a phase model neuron. We compile here a few known results on renewal processes that we will need in later chapters (e.g., ).

10.5.0.1 The first moment (

n=1

)

10.5.0.2 The second moment (

n=2

)

The ISI variance is given by

T_{22}(0)+T_{20}(0)-(T_{10}(0)+T_{12}(0))^2

, which evaluates to Eq ((???)).

10.6 Spike auto-spectrum

To calculate the spectral coherence, Eq. (41), the spike auto-spectrum

P_{yy}(\omega)

is still needed for normalisation. In general this is complicated, but if the linearity assumption could be extended to the spike trains itself it helps. Assume

Then trial averaging,

{\langle}\cdot{\rangle}_{y|x}

yields a result consistent with our previous linear response setting

Take

Y_0(\omega)+G(\omega)X(\omega)

and assume that intrinsic noise and stimulus are uncorrelated

P_{yy}(\omega)={\langle}(Y_0(\omega)+G(\omega)X(\omega))(Y_0^*(\omega)+G^*(\omega)X^*(\omega)){\rangle}

10.7 Mutual entrainment

Assume two neuron

i

and

j

, whose spike-trains are

y(\phi_i(t))=\sum_k\delta(\phi_i(t)-k)

. The spike dynamics is represented by there I/O-equivalent phase oscillators

10.7.1 Spike metric

10.7.2 Time to Spike

11 Appendix

11.1 Novikov-Furutsu-Donsker formula

A relation between Gaussian noise sources and functions of the state variables in stochastic systems is given by the Novikov-Furutsu-Donsker (NFD) formula. It examines the correlation of a stochastic process

\xi(t)

at a fixed instance in time,

t

, and a function

f

x(t)

, which is an other stochastic process that depends on

\xi(t)

(???,(???),(???)). One of the advantages of the NFD formula is that it is applicable to systems with multiplicative noise, as they arise in several applications involving phase response curves. The result is the following

The script uses the formula several times so a formal and very compact derivation follows (???). In many physical examples only the values in the past

t_1\leqslant t

, influence the functional

f

and the integration range can be adjusted accordingly.

Note that this is related to fluctuation-dissipation relations in statistical physics. The state variable

x

is a random process that in turn depends on past values of the random process

\xi(t)

. One may, therefore, treat the function

f

as a functional of the path

\xi_{t_1}:\forall t_1\leqslant t

. Assuming that the process

\xi(t)

has a zero mean function, the first step is to write this functional as a Taylor series¹⁵ around the deterministic function

\eta(t)=0

(omitting the integral domain take it to be understood from

-\infty

t

)

f[\eta+\xi]=f[\eta]|_{\eta=0} + \sum_{k=1}^\infty \frac1{k!} \int\!\cdots\!\int{\mathrm{d}}t_1\cdots{\mathrm{d}}t_k\;\xi(t_1)\cdots\xi(t_k) \left(\frac{{{\updelta}}^kf[\eta]}{{{\updelta}}\eta(t_1)\cdots{{\updelta}}\eta(t_k)}\right)\Big|_{\eta=0}=\left(\exp{\int {\mathrm{d}}t'\,\xi(t')\frac{{{\updelta}}}{{{\updelta}}\eta(t')}}\right)f[\eta]\big|_{\eta=0}

The last expression just a formal, compressed way of writing it using the definition of the exponential displacement operator. As

f[\eta]

is deterministic it can be yanked from any averaging over the noise ensemble, e.g.

${\langle}f[\eta+\xi]{\rangle}=\left{\langle}\left(\exp{\int {\mathrm{d}}t'\,\xi(t')\frac{{{\updelta}}}{{{\updelta}}\eta(t')}}\right)\right{\rangle}f[\eta]\big|_{\eta=0}$.

$=\frac{\left{\langle}\xi(t)\exp\int {\mathrm{d}}t'\;\xi(t')\frac{{{\updelta}}}{{{\updelta}}\eta(t')}\right{\rangle}} {\left{\langle}\exp\int{\mathrm{d}}t'\;\xi(t')\frac{{{\updelta}}}{{{\updelta}}\eta(t')}\right{\rangle}} {\langle}f[\eta+\xi]{\rangle}\big|_{\eta=0}$.

Next, the infinite dimensional Fourier transform of a stochastic process called the characteristic functional is introduced

$\Phi[\lambda]=\left{\langle}\exp\left({\mathrm{i}}\int{\mathrm{d}}t'\;\lambda(t')\xi(t')\right)\right{\rangle}$.

For the, by assumption, Gaussian process

\xi_t

it is known to be the exponential of a quadratic form

\Phi[\lambda]=\exp\left(-\frac12\int{\mathrm{d}}t_1{\mathrm{d}}t_2\; \lambda(t_1)C(t_1,t_2)\lambda(t_2)\right)

which must be real,

\Phi[\lambda]\in{I\!\!R}

, because the density is symmetric around

\eta(t)=0

$\frac{{\langle}\xi(t)\exp{\mathrm{i}}\int{\mathrm{d}}t'\xi(t')\lambda(t'){\rangle}} {{\langle}\exp{\mathrm{i}}\int{\mathrm{d}}t'\xi(t')\lambda(t'){\rangle}} =\frac{{{\updelta}}}{{\mathrm{i}}{{\updelta}}\lambda} \ln\left{\langle}\exp\;{\mathrm{i}}\int{\mathrm{d}}t'\xi(t')\lambda(t')\right{\rangle}=\frac{{{\updelta}}}{{\mathrm{i}}{{\updelta}}\lambda}\ln\Phi[\lambda]$,

Back substituting

{\mathrm{i}}\lambda(t)\to{{\updelta}}/{{\updelta}}\eta_{t}

we obtain Eq. (81).

Bibliography

Atick, Joseph J. 1992. “Could Information Theory Provide an Ecological Theory of Sensory Processing?” Network: Computation in Neural Systems 3 (2): 213–51. http://www.tandfonline.com/doi/abs/10.1088/0954-898X_3_2_009.

Barlow, HB. 1961. “Possible Principles Underlying the Transformations of Sensory Messages.” In Sensory Communication, edited by WA Rosenblith, 217–34. MIT Press.

Brown, Eric, Jeff Moehlis, and Philip Holmes. 2004. “On the Phase Reduction and Response Dynamics of Neural Oscillator Populations.” Neural Computation 16 (4): 673–715. doi:10.1162/089976604322860668.

Chacron, Maurice J., Benjamin Lindner, and André Longtin. 2004. “Noise Shaping by Interval Correlations Increases Information Transfer.” Physical Review Letters 92 (8). doi:10.1103/PhysRevLett.92.080601.

Chicone, Carmen. 2006. Ordinary Differential Equations with Applications. Springer Science & Business Media.

Cover, Thomas M., and Joy A. Thomas. 2012. Elements of Information Theory. John Wiley & Sons.

Ermentrout, Bard. 1996. “Type I Membranes, Phase Resetting Curves, and Synchrony.” Neural Computation 8 (5): 979–1001. http://www.mitpressjournals.org/doi/abs/10.1162/neco.1996.8.5.979.

Ermentrout, G. Bard, Leon Glass, and Bart E. Oldeman. 2012. “The Shape of Phase-Resetting Curves in Oscillators with a Saddle Node on an Invariant Circle Bifurcation.” Neural Computation 24 (12): 3111–25. doi:10.1162/NECO_a_00370.

Ermentrout, G., and N. Kopell. 1986. “Parabolic Bursting in an Excitable System Coupled with a Slow Oscillation.” SIAM Journal on Applied Mathematics 46 (2): 233–53. doi:10.1137/0146017.

Fox, Ronald F., and Yan-nan Lu. 1994. “Emergent Collective Behavior in Large Numbers of Globally Coupled Independently Stochastic Ion Channels.” Physical Review E 49 (4): 3421–31. doi:10.1103/PhysRevE.49.3421.

Gabbiani, Fabrizio, and Christof Koch. 1998. “Principles of Spike Train Analysis.” Methods in Neuronal Modeling 12 (4): 313–60. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.383.7571&rep=rep1&type=pdf.

Gardiner, C. W. 2004. Handbook of Stochastic Methods for Physics, Chemistry, and the Natural Sciences. 3rd ed. Springer Series in Synergetics. Berlin ; New York: Springer-Verlag.

Goldobin, D. S., and A. S. Pikovsky. 2005. “Synchronization of Self-Sustained Oscillators by Common White Noise.” Physica A: Statistical Mechanics and Its Applications, New Horizons in Stochastic ComplexityInternational Workshop on New Horizons in Stochastic Complexity, 351 (1): 126–32. doi:10.1016/j.physa.2004.12.014.

Goldobin, Denis S., and Arkady Pikovsky. 2005. “Synchronization and Desynchronization of Self-Sustained Oscillators by Common Noise.” Physical Review E 71 (4). doi:10.1103/PhysRevE.71.045201.

Goldwyn, Joshua H., and Eric Shea-Brown. 2011. “The What and Where of Adding Channel Noise to the Hodgkin-Huxley Equations.” PLoS Comput Biol 7 (11): e1002247. doi:10.1371/journal.pcbi.1002247.

Goldwyn, Joshua H., Nikita S. Imennov, Michael Famulare, and Eric Shea-Brown. 2011. “Stochastic Differential Equation Models for Ion Channel Noise in Hodgkin-Huxley Neurons.” Physical Review E 83 (4): 041908. doi:10.1103/PhysRevE.83.041908.

Golshani, Leila, and Einollah Pasha. 2010. “Rényi Entropy Rate for Gaussian Processes.” Information Sciences 180 (8): 1486–91. doi:10.1016/j.ins.2009.12.012.

Hodgkin, Alan L., and Andrew F. Huxley. 1952. “A Quantitative Description of Membrane Current and Its Application to Conduction and Excitation in Nerve.” The Journal of Physiology 117 (4): 500. http://www.ncbi.nlm.nih.gov/pmc/articles/pmc1392413/.

Izhikevich, Eugene M. 2007. Dynamical Systems in Neuroscience. MIT Press.

Kielhöfer, Hansjörg. 2011. Bifurcation Theory: An Introduction with Applications to Partial Differential Equations. Springer Science & Business Media.

Kuramoto, Y. 1984. Chemical Oscillations, Waves, and Turbulence. Springer Science & Business Media.

Laughlin, S. 1981. “A Simple Coding Procedure Enhances a Neuron’s Information Capacity.” Zeitschrift Fur Naturforschung. Section C, Biosciences 36 (9-10): 910–12.

Linaro, Daniele, Marco Storace, and Michele Giugliano. 2011. “Accurate and Fast Simulation of Channel Noise in Conductance-Based Model Neurons by Diffusion Approximation.” PLoS Comput Biol 7 (3): e1001102. doi:10.1371/journal.pcbi.1001102.

Lindner, Benjamin, Maurice J. Chacron, and André Longtin. 2005. “Integrate-and-Fire Neurons with Threshold Noise: A Tractable Model of How Interspike Interval Correlations Affect Neuronal Signal Transmission.” Physical Review E 72 (2). doi:10.1103/PhysRevE.72.021911.

Orio, Patricio, and Daniel Soudry. 2012. “Simple, Fast and Accurate Implementation of the Diffusion Approximation Algorithm for Stochastic Ion Channels with Multiple States.” PLoS ONE 7 (5): e36670. doi:10.1371/journal.pone.0036670.

Pezo, Danilo, Daniel Soudry, and Patricio Orio. 2014. “Diffusion Approximation-Based Simulation of Stochastic Ion Channels: Which Method to Use?” Frontiers in Computational Neuroscience 8: 139. doi:10.3389/fncom.2014.00139.

Rossum, M. C. W. van. 2001. “A Novel Spike Distance.” Neural Computation 13 (4): 751–63. doi:10.1162/089976601300014321.

Teramae, Jun-nosuke, and Dan Tanaka. 2004. “Robustness of the Noise-Induced Phase Synchronization in a General Class of Limit Cycle Oscillators.” Physical Review Letters 93 (20): 204103. doi:10.1103/PhysRevLett.93.204103.

Winfree, Arthur T. 2001. The Geometry of Biological Time. Springer Science & Business Media.

Fortunately the jelly fish Aglanta digitalae does not care much for dogmas and encodes its swimming patterns in different action potential shapes↩
Actually it does, if you bring energetic considerations into baring.↩
There is also plenty of coding in graded potentials going on for example in some of Drosophila melanogaster’s neuron or your retina. C. elegans seems to do completely without spikes.↩
For now this means that its an invertable matrix; $\forall t:\exists{\underline R}^{-1}(t)$ ↩
Four nucleobases A: Adenosine, G: Guanine, C: Cytosine, T: Tymine.↩
Here, the alphabet is a firing rate ${f_0}\in{I\!\!R}$ . It might be more reasonable to think about spike counts in a given time window, which is still a countable set.↩
Remember the determinant is $|{\underline K}|=\prod_{i=1}^n\lambda_i$ . So $\ln|{\underline K}|=\sum_{i=1}^n\ln\lambda_i$ . In terms of the matrix-logarithm and the trace the determinant can be expressed as $|{\underline K}|=\exp\tr\ln{\underline K}$.↩
Because the trace is invariant under similarity transforms $\tr{\underline K}=\sum_{i=1}^n\lambda_i$.↩
Similar to the particle and wave duality in quantum mechanics.↩
This can be done formally using Chapman-Enskog or adiabatic elimination procedures. The derivation may be included in to future.↩
Not perfectly though, because there is the intrinsic noise, $\xi_i(t)$ .↩
This may now refer to one and the same neuron presented with the same frozen stimulus time and again or a population of non-interacting very similar neurons, which get the same input.↩
independent of $t$ ↩
The inner product is the inner product on a function space ${\langle}f,g{\rangle}=\int{\mathrm{d}}x\,f(x)g(x)$ .↩
We are expanding not a function nor a vector-valued function but a functional.↩

Location:			House 6, Lecture Hall - Campus Nord
Time:			Tue., 16:00-18:00
Contact:			Email
Script:			under construction

$\to$	Compress	$\to$	Encode
			$\quad\downarrow$
			Channel
			$\quad\downarrow$
${\leftarrow}$	Decompress	${\leftarrow}$	Decode

input signal		neural		response
$x(t)\in{I\!\!R}$	$\to$	pathway	$\to$	$y(t)\in{I\!\!R}$

Advanced mathematical methods for biology

Jan-Hendrik Schleimer (AG Theoretische Neurophysiologie, Prof. Schreiber)

1 Preface

1.1 (Dis)claimer

2 The Floquet theory of the aktion potential

2.1 Periodic event

2.2 Tonic spikes are limit cycles

2.3 Limit cycle stability

2.3.1 Eigensystem of the Floquet matrix

2.4 Neutral dimension and phase shifts

3 Fourier theory

3.1 The Fourier base

3.2 Existence of the Fourier integral

3.2.1 Orthonormality

3.2.2 Convolution

3.2.3 Derivative

4 The continuum limit of a membrane patch

4.1 The ion channel as a Markov model

4.1.1 The nn-state channel

4.1.2 Simulation a Jump process

4.2 Statistically equivalent diffusion process (Orenstein-Uhlenbeck Process)

5 Information theory for the living

5.1 The communication process

5.2 Source coding, data compression and efficient representation

5.3 Channel coding

5.4 Information transmission (continuous case)

5.5 Linear stimulus reconstruction and a lower bound on the information rate (decoding view)

6 Linear response filter

6.1 Instantaneous firing rate in the continuum limit

6.2 Phase flux = firing rate

7 Numerical continuation of fixpoints and orbits

7.1 Continuation of fixed points

7.2 Local bifurcations: What happens if ∇f⃗\nabla\vec f is singular?

7.2.1 Folds and one-dimensional nullspaces

7.3 Stability exchange

7.3.1 Extended system

7.4 Continuation of boundary value problems and periodic orbits

8 PRC near the centre manifold

8.1 Dynamics in the centre manifold of a saddle-node

8.2 PRC near the centre manifold

9 Phase suscetability to channel noise near saddle-node loop bifurcations

10 Event time series and phase descriptions

10.1 Synchronisation and phase locking

10.2 Spike reliability and stimulus entrainment

10.3 Inter-spike statistics

10.3.1 First passage time (no input and constant noise)

10.3.2 Phase dependent noise

10.3.3 Moments of the ISI distribution

10.4 Moments of the inter-spike interval distribution

10.5 Renewal equation

10.5.0.1 The first moment (n=1n=1)

10.5.0.2 The second moment (n=2n=2)

10.6 Spike auto-spectrum

10.7 Mutual entrainment

10.7.1 Spike metric

10.7.2 Time to Spike

11 Appendix

11.1 Novikov-Furutsu-Donsker formula

Bibliography

4.1.1 The $n$ -state channel

7.2 Local bifurcations: What happens if $\nabla\vec f$ is singular?

10.5.0.1 The first moment ( $n=1$ )

10.5.0.2 The second moment ( $n=2$ )