Neural decoding

From Wikipedia, the free encyclopedia

Neural decoding is a neuroscience-related field concerned with the reconstruction of sensory and other stimuli from information that has already been encoded and represented in the brain by networks of neurons. Reconstruction refers to the ability of the researcher to predict what sensory stimuli the subject is receiving based purely on neuron action potentials. Therefore, the main goal of neural decoding is to characterize how the electrical activity of neurons elicit activity and responses in the brain.^[1]

This article specifically refers to neural decoding as it pertains to the mammalian neocortex.

Overview

When looking at a picture, our brains are constantly making decisions about what object we are looking at, where we need to move our eyes next, and what we find to be the most salient aspects of the input stimulus. As these images hit the back of our retina, these stimuli are converted from varying wavelengths to a series of neural spikes called action potentials. These pattern of action potentials are different for different objects and different colors; we therefore say that the neurons are encoding objects and colors by varying their spike rates or temporal pattern. Now, if someone were to probe the brain by placing electrodes in the primary visual cortex, they may find what appears to be random electrical activity. These neurons are actually firing in response to the lower level features of visual input, possibly the edges of a picture frame. This highlights the crux of the neural decoding hypothesis: that is possible to reconstruct a stimulus from the response of the ensemble of neurons that represent it. By this we mean, it is possible to look at spike train data and say that the person or animal we are recording from is looking at a red ball.

Encoding to decoding

Implicit about the decoding hypothesis is the assumption that neural spiking in the brain somehow represents stimuli in the external world. The decoding of neural data would be impossible if the neurons were firing randomly: nothing would be represented. This process of decoding neural data forms a loop with neural encoding. First, the organism must be able to perceive a set of stimuli in the world - say a picture of a hat. Seeing the stimuli must result in some internal learning: the encoding stage. After varying the range of stimuli that is presented to the observer, we expect the neurons to adapt to the statistical properties of the signals, encoding those that occur most frequently:^[2] the efficient-coding hypothesis. Now neural decoding is the process of taking these statistical consistencies, a statistical model of the world, and reproducing the stimuli. This may map to the process of thinking and acting, which in turn guide what stimuli we receive, and thus, completing the loop.

In order to build a model of neural spike data, one must both understand how information is originally stored in the brain and how this information is used at a later point in time. This neural coding and decoding loop is a symbiotic relationship and the crux of the brain's learning algorithm. Furthermore, the processes that underlie neural decoding and encoding are very tightly coupled and may lead to varying levels of representative ability^[3]^[4]

Spatial resolutions

Much of the neural decoding problem depends on the spatial resolution of the data being collected. The goal here is to answer the question: how many neurons do I need to record in order to reconstruct the stimulus with reasonable accuracy. This question intimately relates to the means by which data is collected as it relates to the area being recorded. Neurons with small areas of coverage such as rods and cones in the retina may require more recordings than simple cells in the primary visual cortex. Here, rods and cones only respond to the color of small visual area, while simple cells respond to the orientation of lines.

Previous recording methods relied on stimulating single neurons over a repeated series of tests in order to generalize this neuron's behavior.^[5] New techniques such as high-density multi-electrode array recordings and multi-photon calcium imaging techniques now make it possible to record from upwards of a few hundred neurons. Even with better recording techniques, the focus of these recordings must be on an area of the brain that is both manageable and qualitatively understood. Many studies look at spike train data gathered from the ganglion cells in the retina. Of all the possible subset of neurons to study, this particular area has the benefits of being strictly feedforward, retinotopic, and amenable to current recording granularities. The duration, intensity, and location of the stimulus can be controlled guaranteeing that a predetermined number of ganglion cells can be sampled within a significantly structured microcosm of the visual system.^[6] In addition to the visual system, other studies evaluate the discriminatory ability of rat facial whiskers^[7] and the olfactory coding of moth pheromone receptor neurons^[8] as mediums for collected spike train data.

Even with ever-improving recording techniques, one will always run into the limited sampling problem: given a limited number of recording trials, it is impossible to completely account for the error associated with noisy data obtained from stochastically functioning neurons (for example, a neuron's electric potential fluctuates around its resting potential due to a constant influx and efflux of sodium and potassium ions). Therefore, it is not possible to perfectly reconstruct a stimulus from spike data. Luckily, even with noisy data, the stimulus can still be reconstructed within acceptable error bounds.^[9]

Temporal resolutions

Another important consideration to take into account when decoding the neural code are the timescales and frequencies of the stimulus being presented to the observer. Quicker timescales and higher frequencies demand faster and more precise responses in neural spike data. In humans, millisecond precision has been observed throughout the visual cortex, the retina,^[10] and the lateral geniculate nucleus, so one would suspect this to be the appropriate measuring frequency. This has been confirmed in studies that quantify the responses of neurons in the lateral geniculate nucleus to white-noise and naturalistic movie stimuli.^[11] At the cellular level, spike-timing-dependent plasticity operates at millisecond timescales;^[12] therefore, models seeking biological relevance should be able to perform at these temporal scales.

Probabilistic decoding

When decoding neural data, arrival times of each spike $t_{1},{\text{ }}t_{2},{\text{ }}...,{\text{ }}t_{n}{\text{ }}={\text{ }}\{t_{i}\}$ , and the probability of seeing a certain stimulus, $P[s(t)]$ may be the extent of the available data. The prior distribution $P[s(t)]$ defines an ensemble of signals, and represents the likelihood of seeing a stimulus in the world based on previous experience. The spike times may also be drawn from a distribution $P[\{t_{i}\}]$ ; however, what we want to know is the probability distribution over a set of stimuli given a series of spike trains $P[s(t)|\{t_{i}\}]$ , which is called the response-conditional ensemble. What remains is the characterization of the neural code by translating stimuli into spikes, $P[\{t_{i}\}|s(t)]$ ; the traditional approach to calculating this probability distribution has been to fix the stimulus and examine the responses of the neuron. Combining everything using Bayes' Rule results in the simplified probabilistic characterization of neural decoding: $P[s(t)|\{t_{i}\}]=P[\{t_{i}\}|s(t)]*(P[s(t)]/P[\{t_{i}\}])$ . An area of active research consists of finding better ways of representing and determining $P[\{t_{i}\}|s(t)]$ .^[13] The following are some such examples.

Spike train number

The simplest coding strategy is the spike train number coding. This method assumes that the spike number is the most important quantification of spike train data. In spike train number coding, each stimulus is represented by a unique firing rate across the sampled neurons. The color red may be signified by 5 total spikes across the entire set of neurons, while the color green may be 10 spikes; each spike is pooled together into an overall count. This is represented by:

$P(r|s)=\prod _{{}}P(n_{{ij}}|s)$

where $r=n=$ the number of spikes, $n_{{ij}}$ is the number of spikes of neuron $i$ at stimulus presentation time $j$ , and s is the stimulus.

Instantaneous rate code

Adding a small temporal component results in the spike timing coding strategy. Here, the main quantity measured is the number of spikes that occur within a predefined window of time T. This method adds another dimension to the previous. This timing code is given by:

$P(r|s)=\prod _{{l}}\left[\prod _{{i,j}}v_{i}(t_{{ijl}}|s)dt\right]exp\left[-\sum _{{i}}\int _{{0}}^{{T}}dtv_{i}(t|s)\right]$

where $t_{{ijl}}$ is the jth spike on the lth presentation of neuron i, $v_{i}(t|s)$ is the firing rate of neuron i at time t, and 0 to T is the start to stop times of each trial.

Temporal correlation

Temporal correlation code, as the name states, adds correlations between individual spikes. This means that the time between a spike $t_{i}$ and its preceding spike $t_{{i-1}}$ is included. This is given by:

$P(r|s)=\prod _{{l}}\left[\prod _{{i,j}}v_{i}(t_{{ijl}},\tau (t_{{ijl}})|s)dt\right]exp\left[-\sum _{{i}}\int _{{0}}^{{T}}dtv_{i}(t,\tau (t)|s)\right]$

where $\tau (t)$ is the time interval between a neurons spike and the one preceding it.

Ising decoder

Another description of neural spike train data uses the Ising model borrowed from the physics of magnetic spins. Because neural spike trains effectively binarized(either on or off) at small time scales (10 to 20 ms), the Ising model is able to effectively capture the present pairwise correlations,^[14] and is given by:

$P(r|s)={\frac {1}{\mathrm{Z} (s)}}exp\left(\sum _{{i}}h_{i}(s)r_{i}+{\frac {1}{2}}\sum _{{i\neq j}}J_{{ij}}(s)r_{i}r_{j}\right)$

where $r=(r_{1},r_{2},...,r_{n})^{T}$ is the set of binary responses of neuron i, $h_{i}$ is the external fields function, $J_{{ij}}$ is the pairwise couplings function, and $\mathrm{Z} (s)$ is the partition function.

Agent-based decoding

In addition to the probabilistic approach, agent-based models exist that capture the spatial dynamics of the neural system under scrutiny. One such model is hierarchical temporal memory, which is a machine learning framework that organizes visual perception problem into a hierarchy of interacting nodes (neurons). The connections between nodes on the same levels and a lower levels are termed synapses, and their interactions are subsequently learning. Synapse strengths modulate learning and are altered based on the temporal and spatial firing of nodes in response to input patterns.^[15]^[16]

While it is possible to take the firing rates of these modeled neurons, and transform them into the probabilistic and mathematical frameworks described above, agent-based models provide the ability to observe the behavior of the entire population of modeled neurons. Researchers can circumvent the limitations implicit with lab-based recording techniques. Because this approach does rely on modeling biological systems, error arises in the assumptions made by the researcher and in the data used in parameter estimation.

Applicability

The advancement in our understanding of neural decoding benefits the development of brain-machine interfaces, prosthetics^[17] and the understanding of neurological disorders such as epilepsy.^[18]

References

↑ Jacobs AL, Fridman G, Douglas RM, et al. (April 2009). "Ruling out and ruling in neural codes". Proc. Natl. Acad. Sci. U.S.A. 106 (14): 5936–41. doi:10.1073/pnas.0900573106. PMC 2657589. PMID 19297621. Cite uses deprecated parameters (help)
↑ Barlow, H. (1961). Possible principles underlying the transformation of sensory messages. Sensory communication.
↑ Chacron MJ, Longtin A, Maler L (2004). "To burst or not to burst?". J Comput Neurosci 17 (2): 127–36. doi:10.1023/B:JCNS.0000037677.58916.6b. PMID 15306735.
↑ Boloori AR, Jenks RA, Desbordes G, Stanley GB (July 2010). "Encoding and decoding cortical representations of tactile features in the vibrissa system". J. Neurosci. 30 (30): 9990–10005. doi:10.1523/JNEUROSCI.0807-10.2010. PMC 2957657. PMID 20668184. Cite uses deprecated parameters (help)
↑ Hubel DH, Wiesel TN, LeVay S (April 1977). "Plasticity of ocular dominance columns in monkey striate cortex". Philos. Trans. R. Soc. Lond., B, Biol. Sci. 278 (961): 377–409. PMID 19791. Cite uses deprecated parameters (help)
↑ Warland DK, Reinagel P, Meister M (November 1997). "Decoding visual information from a population of retinal ganglion cells". J. Neurophysiol. 78 (5): 2336–50. PMID 9356386. Cite uses deprecated parameters (help)
↑ Arabzadeh E, Panzeri S, Diamond ME (September 2006). "Deciphering the spike train of a sensory neuron: counts and temporal patterns in the rat whisker pathway". J. Neurosci. 26 (36): 9216–26. doi:10.1523/JNEUROSCI.1491-06.2006. PMID 16957078. Cite uses deprecated parameters (help)
↑ Kostal L, Lansky P, Rospars JP (April 2008). "Efficient olfactory coding in the pheromone receptor neuron of a moth". PLoS Comput. Biol. 4 (4): e1000053. doi:10.1371/journal.pcbi.1000053. PMC 2291565. PMID 18437217. Cite uses deprecated parameters (help)
↑ Rolls ET, Treves A (November 2011). "The neuronal encoding of information in the brain". Prog. Neurobiol. 95 (3): 448–90. doi:10.1016/j.pneurobio.2011.08.002. PMID 21907758. Cite uses deprecated parameters (help)
↑ Berry MJ, Meister M (March 1998). "Refractoriness and neural precision". J. Neurosci. 18 (6): 2200–11. PMID 9482804. Cite uses deprecated parameters (help)
↑ Butts DA, Weng C, Jin J, et al. (September 2007). "Temporal precision in the neural code and the timescales of natural vision". Nature 449 (7158): 92–5. doi:10.1038/nature06105. PMID 17805296. Cite uses deprecated parameters (help)
↑ Song S, Miller KD, Abbott LF (September 2000). "Competitive Hebbian learning through spike-timing-dependent synaptic plasticity". Nat. Neurosci. 3 (9): 919–26. doi:10.1038/78829. PMID 10966623. Cite uses deprecated parameters (help)
↑ Rieke, F. (1999). Spikes: exploring the neural code. exploring the neural code (p. 395). The MIT Press.
↑ Schaub MT, Schultz SR (February 2012). "The Ising decoder: reading out the activity of large neural ensembles". J Comput Neurosci 32 (1): 101–18. doi:10.1007/s10827-011-0342-z. PMID 21667155. Cite uses deprecated parameters (help)
↑ Hawkins, J., Ahmad, S., & Dubinsky, D. (2006). Hierarchical temporal memory: Concepts, theory and terminology. Whitepaper.
↑ Hawkins, J., & Blakeslee, S. (2005). On intelligence. Owl Books.
↑ Donoghue JP (November 2002). "Connecting cortex to machines: recent advances in brain interfaces". Nat. Neurosci. 5 (Suppl): 1085–8. doi:10.1038/nn947. PMID 12403992. Cite uses deprecated parameters (help)
↑ Rolston JD, Desai SA, Laxpati NG, Gross RE (October 2011). "Electrical stimulation for epilepsy: experimental approaches". Neurosurg. Clin. N. Am. 22 (4): 425–42, v. doi:10.1016/j.nec.2011.07.010. PMC 3190668. PMID 21939841. Cite uses deprecated parameters (help)