Temporal Inhibition in Character Identification
Thomas A. Busey
Indiana University
In press: Perception & Psychophysics
Please send correspondence to: Thomas A. Busey
Department of Psychology
Indiana University
Bloomington, IN 47405
email: busey@indiana.edu
Abstract
Models of information processing tasks such as character identification often do not consider the nature of the initial sensory representation from which task-relevant information is extracted. An important component of this representation is temporal inhibition, in which the response to a stimulus may inhibit, or in some cases facilitate, processing of subsequent stimuli. Three experiments demonstrate the existence of temporal inhibitory processes in information processing tasks such as character identification and digit recall. An existing information processing model is extended to account for these effects, based in part on models from the detection literature. These experiments also discriminate between candidate neural mechanisms of the temporal inhibition. Implications for the transient deficit theory of dyslexia are discussed.
Despite the fact that we see the world as a continuous stream of visual stimulation, the input to the eyes consists of discrete quantal events. No two photons arrive at exactly the same time, and a visual system that reported the locations of arriving photons with infinite temporal fidelity would simply report a flurry of pinpoints, each representing one photon capture. In order to provide a coherent representation of a scene, the visual system must integrate information over some interval to temporally group photons coming from the same object. To quantify the notion of an integration interval, the response to a stimulus may be characterized by a temporal wave form that represents the result of temporally grouping the arriving photons to form some response that varies over time. Upon stimulus onset this response may rise to some level, and then decay away sometime after stimulus offset. This response is termed the initial sensory response and is assumed to represent the output of neural mechanisms located early on in the visual stream (e.g. the retina and primary visual cortex). It has long been known that the response to a stimulus outlasts its offset (see Coltheart, 1980 for an excellent review), and the characteristics of this initial sensory response reflect this fact. All subsequent visual processing is based on this initial representation, and thus the first step in characterizing the mechanisms that underlie object or character identification must address the nature of this initial sensory representation.
The nature of this sensory response function has previously been shown to be important for models of detection (Sperling & Sondhi, 1968; Watson, 1986), intensity-duration tradeoffs in information processing tasks (Loftus & Ruthruff, 1994), metacontrast masking (Breitmeyer, 1984), completeness ratings and temporal integration performance (Loftus and Irwin, submitted), synchrony judgment tasks (Hogben & Di Lollo, 1974; Matin & Bowen, 1976) and more recently character identification tasks (Loftus, Busey & Senders, 1993; Busey & Loftus, 1994). In general the shape of the sensory response function determines the visibility of the stimulus and the duration of its persistence, which in turn affects the amount of information that can be extracted from the stimulus and the amount of interaction with subsequent stimuli such as backward masks. Models of information processing tasks typically don't consider the nature of this initial sensory representation and instead focus on the processes that are involved with information extraction (e.g. Rumelhart, 1970; Townsend, 1981; Fisher, 1982). More recent model have begun to account for the continuous accumulation of information in information processing tasks (e.g. Eriksen & Schultz, 1979; Gegenfurtner & Sperling, 1993).
Characterizing the nature of the initial sensory representation and describing how its properties can affect performance in information processing tasks such as character recognition is the goal of the present article. To accomplish this goal, I consider models that have been applied to simple detection tasks and conjoin these with an existing information processing model. The resulting theory bridges both the detection and information processing literatures, and is used to address whether results derived from the detection of disks and gratings applies to the identification of letters as well.
Many researchers attempting to characterize the nature of the sensory response function have used a common paradigm, called the Two-Pulse paradigm, to measure the temporal properties of the initial sensory representation. In the Two-Pulse paradigm a stimulus is presented twice for two brief pulses, each time in the same location, separated by an interstimulus interval (ISI) that typically ranges from 0-150 ms. Each pulse will produce a response in the visual system, and the two responses will interact in different ways for different ISIs. The time between pulses is varied to find the longest interval that still provides some evidence of interaction between the percepts of the two pulses. From this estimate of the integration interval, mathematical models are proposed that derive the shape of the sensory response function. This paradigm has been used with disks (e.g. Bouman and van den Brink, 1952; Blackwell, 1963; Purcell and Stewart, 1971), gratings (Watson and Nachmias, 1977), gaussian patches (Bergen and Wilson, 1985), two half displays (Hogben & Di Lollo, 1974) and digits (Busey and Loftus, 1994) in both detection and identification tasks.
As researchers derived estimates of the integration interval, several noted that the form of the interaction between the sensations elicited by the two pulses was more complex than simple summation. Ikeda (1965) provided surprising evidence for an inhibitory interaction at some delay intervals: When two 12.5 ms pulses were separated by a 40-70 ms interstimulus interval, performance was worse than what would be expected based on probability summation of the individual responses. It appears from these data that the response to the first pulse interfered with or inhibited the processing of the second pulse. These effects have become known as temporal inhibition. Ikeda's innovation was to include a condition in which the second pulse could be of opposite contrast, which provides evidence that the two responses could facilitate each other at moderate ISI's (30-50 ms) despite the fact that one was light gray and the other was dark gray on a gray background. This facilitation between opposite-contrast pulses was described by Ikeda as additional evidence for temporal inhibition, under the assumption that a light gray stimulus produces a response that is initially excitatory and then inhibitory. This inhibitory portion of the response can facilitate the excitatory portion of a response created by a dark gray pulse, which is assumed to be compatible with the inhibitory portion of the first pulse's response.
Why might the visual system inhibit its own signal? Some clues come from the spatial domain: lateral inhibition sharpens the percept of an edge such that the response from receptors responding to the dark side of an edge is reduced as a result of inhibition coming from receptors responding to the light side of an edge. We might expect to find similar mechanisms in the temporal domain, as the visual system attempts to sharpen a temporal edge caused by the appearance or disappearance of an object. The importance of inhibition has long been known in the masking literature, where lateral inhibition and several different mechanisms of temporal inhibition have accounted for metacontrast masking, backward masking and saccadic suppression (see Breitmeyer, 1984).
The previous work on the effects of temporal inhibition have used simple stimuli in detection tasks. Using disks and spatial frequency gratings of various sizes, researchers found evidence for temporal inhibition in a wide range of conditions. However, none used more complex tasks and stimuli that are commonly associated with information processing tasks. For instance, grating detection requires only distinguishing the presence of some pattern relative to a blank screen, while character identification requires the classification of a pattern into one of many well-learned categories. In addition, letters are spatially compact and contain a broad range of spatial frequencies, while spatial frequency gratings are the opposite. Thus letters are expected to be mediated by multiple visual channels, while gratings might be mediated by only a few. Alexander, Xie & Derlacki (1994) point out that simple stimuli such as gratings may not be sensitive to visual pathologies in persons with disorders of the visual system. Periodic test stimuli such as gratings may not reveal the extent of spatial undersampling in uncorrected refractive error, while the nonperiodic nature of letters may provide a better probe of this disorder. Thus the relatively complex spatial properties of letters suggest that findings from the detection literature may not apply directly to character identification.
There are several reasons why temporal inhibition is important for information processing tasks such as character identification. First, previous models have demonstrated that the nature of the initial sensory representation can affect the rate at which information is extracted from this representation (e.g. Busey & Loftus, 1994). Temporal inhibition qualitatively changes the nature of the initial sensory representation and can therefore affect later processing stages as well. Second, temporal inhibition by definition affects the processing of stimuli that follow the offset of an object. Tasks such as backward masking, metacontrast masking, temporal integration, synchrony judgment, partial report and rapid serial visual processing (RSVP) may all be influenced to some degree by temporal inhibition. Third, temporal inhibition may provide a signature of the underlying pathways or mechanisms at work in a given task. By delimiting the range of stimuli and tasks that show evidence of temporal inhibition, we can draw conclusions about the properties of the mechanisms that produce this effect. Finally, evidence for temporal inhibition in information processing tasks such as character identification may have implications for disabilities such as dyslexia, in which a possible deficit in processing high temporal frequencies has been suggested (Lovegrove, Garzia & Nicholson, 1990; Williams & Lecluyse, 1990). I return to this final point in the General Discussion.
It is not at all clear a priori that information processing tasks such as character identification will produce the temporal inhibition effects seen with disks and gratings. In the spatial domain, Solomon & Pelli (1994) demonstrate that letters and gratings were mediated by the same spatial frequency channels. An important mediator of temporal inhibition is the spatial frequencies that are contained in the stimulus. Ohtani and Ejima (1988) demonstrate that temporal inhibition will only occur for sine-wave gratings with spatial frequencies lower than .75-1.5 cycles per degree, depending on the background luminance. Letter recognition may not depend on these low spatial frequencies. Solomon & Pelli (1994) demonstrate that letters are recognized primarily on the basis of spatial frequencies in an octave range centered at 3.2 cycles per degree (3 cycles per letter), which has a lower cutoff around 1.5 cycles per degree. If the spatial frequencies that underlie letter identification are higher than those that produce temporal inhibition, we will not observe effects consistent with temporal inhibition in a character identification task.
Alexander, Xie & Derlacki (1994) addressed whether the temporal characteristics of letters and gratings differed. They measured the spatial contrast sensitivity functions for Sloan letters at different sizes and D6 gratings at various spatial frequencies, and found systematic differences that depend upon temporal frequency. The size of the Sloan letters is expressed as the minimum angle of resolution (MAR) in units of arc minutes, while a D6 grating is the 6th spatial derivative of a gaussian function in units of cycles per degree. These stimuli were flickered at either 2 or 16 Hz. At the 2 Hz flicker rate, systematic differences exist between the two spatial contrast sensitivity functions, demonstrating that temporal frequency had a systematic effect on the contrast sensitivity function for the two patterns. Given this finding, we might expect differences in the temporal domain between experiments that used gratings or disks and the present experiments that use digits. The letter stimuli selectively improves sensitivity relative to the gratings at the lowest temporal frequencies, suggesting that the letters might rely on lower temporal frequencies than gratings. As a result, letters may not show evidence for temporal inhibition, since temporal inhibition helps the system preserve and respond to higher temporal frequencies.
The results of these studies do not provide a clear answer to the question of whether the mechanisms that produce temporal inhibition contribute to character identification tasks. The current article addresses this question by applying an existing information processing model to tasks adapted from the detection literature. This article is organized as follows. I begin with a brief summary of the experimental variables that influence temporal inhibition in detection tasks. I then briefly describe an existing information processing model that can be used to model the empirical results. In three different paradigms I provide empirical evidence that temporal inhibition does indeed exist in character identification tasks, and I show how the an existing model can account for these effects. Finally, I discuss possible mechanisms that underlie temporal inhibition and its implications for other information processing tasks.
Variable Affecting Temporal Inhibition
Spatial frequency, stimulus size and background adaptation level all have been shown to affect temporal inhibition, and the results of a variety of studies are summarized in Table 1. Based on a meta-analysis of these findings, Ikeda (1986) concluded that for disks, inhibitory phenomena only appear when the stimulus size becomes larger than about 13 min of arc. However, the temporal delay between the pulses at which inhibition is largest is not affected by the size of the stimulus beyond 13 min of arc. A parametric study by Ohtani & Ejima (1988) demonstrated that for sine-wave gratings, temporal inhibition occurs only for gratings with spatial frequencies larger that 1.5-0.75 cycles per degree, depending on the background luminance.
Blackwell (1963) found evidence for temporal inhibition only for stimuli shown at a background luminance of 10 ft-lamberts. Roufs (1973) used 1° stimuli on three different background levels, and found an inhibitory effect for stimuli shown at background levels of 42 trolands and 1200 trolands, but not at 1 troland. Thus it appears that inhibition depends upon the background level, although the critical background level is not fully defined. As the visual system becomes exposed to higher and higher light levels, the integration period grows shorter and shorter. This fits with the intuitive model that a visual system should integrate over longer periods in low light levels to gain sensitivity, but integrate over shorter intervals in higher light levels to gain responsiveness to changes in the visual scene.
Insert Table 1 About Here
A number of different mathematical models have been developed to account for the effects of temporal inhibition, and these are summarized in Table 2. Most of the models follow Ikeda's (1965) proposal that the visual system acted as a temporal filter, which had the effect of temporally blurring the rapid changes created by the onset and offset of each pulse and causing the response to a stimulus to persist and interact with subsequent pulses. The behavior of the temporal filter is characterized by an impulse response function that defines the severity of the temporal blurring. The models then include some form of non-linearity and a decision-making rule that allows prediction of contrast thresholds.
Insert Table 2 About Here
Conjoining Temporal Inhibition with an Information Processing Model
The models of Ikeda (1965), Rashbass (1970), Sperling & Sondhi (1968), and Watson & Nachmias (1977) provide a springboard for the modern linear filter models that have been proposed to account for many of the relationships between stimulus variables and temporal sensitivity summarized in Table 1. While each differ in form of the impulse response function and the nature of the response mechanisms, they have been limited to modeling simple detection tasks. Additional model components are required to account for information processing tasks such as character identification. A model that includes all of these characteristics is developed below as an extension of an extant theory, known as the Linear Systems Theory (LST), proposed by Loftus, Busey & Senders (1993) and Busey & Loftus (1994).
Insert Figure 1 About Here
Figure 1 shows the components of the LST model that can be used to a) account for the temporal summation that occurs in the Two-Pulse data, b) determine the temporal characteristics of the sensory representation used in a given visual task and c) provide evidence for temporal inhibition in information processing tasks. The onset and offset of the digits are represented as changes in contrast over time, defined as a function f(t), which is then temporally low-pass filtered by the model into a sensory representation, a(t), that is a temporally blurred representation of the physical stimulus. Information is then extracted from the sensory response at a rate r(t), which is used to make a performance prediction for a stimulus condition. Each of these components is described below.
Characterization of the Physical Stimulus
The function f(t) represents the contrast of a pulse stimulus over time, as illustrated in the top panel of Figure 1. The negative contrast condition of the two-pulse paradigm sets the contrast to -F. The initial sensory representation engendered by the physical stimulus is generated by convolving the physical stimulus representation with an impulse response function, g(t).
a(t) = f(t) * g(t) =
Eq. 1
The result a(t), called the sensory response function, is shown the bottom panel of Figure 1. The exact form of g(t) is described below, but the effect is that a(t) is a temporal blurring or filtering of the physical stimulus f(t). The shape of the impulse response function, g(t), dictates the type of temporal blurring. The sensory response function component of the LST theory (a(t)) is based on work from Andrew Watson (Watson, 1986), George Sperling (Sperling and Sondhi, 1968) and others working in the temporal domain of perception.
The form of the impulse response function, g(t), is formulated as the difference of two gamma functions, the first of which is large and excitatory and the second is smaller and inhibitory (Watson, 1986):
g(t) =
- s[
] Eq. 2
where t represents the time-constant of the gamma function, n1 and n2 represent the number of exponentially-decaying stages in each gamma function, r represents the delay of the inhibitory component relative to the initial excitatory component and t represents time since stimulus onset. The amount of temporal inhibition is determined by the contribution of the second term, as determined by the parameter s. When s>0, this produces a bi-phasic impulse response function, which is shown as the dashed curve in the middle panel of Figure 1. The left panel of Figure 2 shows two impulse response functions produced by setting s to 0 or greater than zero. The current version of the LST model assumes a monophasic impulse response function, which sets s = 0 and therefore includes just the first term of Eq. 2, as shown as the solid curve in the middle panel of Figure 1. When s=0, the model does not include temporal inhibition.
Insert Figure 2 About Here
Detection data for stimuli such as high spatial frequencies and color are often modeled by setting s to 0, which gives a monophasic impulse response function g(t) as shown as a solid curve in the left panel of Figure 2. An alternative way of representing the same information is by taking the Fourier transform of the impulse response function g(t), which results in the transfer function, G(t), which is analogous to the temporal contrast sensitivity function (TCSF). The TCSF plot shows the sensitivity of a system to different temporal frequencies. The TCSF corresponding to the solid line in the left panel of Figure 2 is given by the solid line in the right panel. These curves represent a purely sustained response, and give a monotonically-decreasing TCSF.
Detection data for stimuli containing mainly low spatial frequencies, or stimuli presented on bright backgrounds, often are modeled by s> 0. In this case the impulse response inhibits processing after an initial excitatory response, which results in an inhibitory lobe in the impulse response function g(t) (dashed line in Figure 2, left panel) and a characteristic TCSF with a decrease in sensitivity at low temporal frequencies (dashed curve in Figure 2, right panel). This decrease is caused by the inhibitory lobe, causes the system to inhibit itself prematurely for slowly changing stimuli.
Information Loss: The Sensory Threshold
All linear filter models include some form of non-linearity after the initial linear filter portions. The LST model has adopted a sensory threshold (Loftus, Busey & Senders, 1993; Busey & Loftus, 1994), such that only that portion of the a(t) function that rises above the threshold contributes to performance. In addition, because information relevant to character identification is present in both positive and negative contrast pulses, we take the absolute value of the signal after assessing the sensory threshold. Mathematically, this is expressed as:
![]()
This rectification simply states that information can be acquired from both positive- and negative-contrast stimuli, not that observers cannot tell the difference between a light and dark digit. The exact form of the sensory non-linearity is not critical: Watson's probability summation in time model (Watson, 1978), which uses a power function as a non-linearity, will make equivalent predictions. The power function acts very similar to a hard threshold, since they both tend to punish weak responses and reward strong responses. For example, a long dim stimulus may never rise above threshold and never produce a response; likewise the same stimulus will produce a very small response in Watson's model, since when the a(t) values are raised to an exponent, small numbers become much smaller while larger numbers close to 1.0 remain relatively unchanged or even become larger. Thus both models selectively impair the responses to weak signals.
Note that the sensory threshold, q, is conceptually quite different than a detection threshold, which is a statistical concept representing a certain performance level. q is also distinct from the decision thresholds used in detection linear filter models, which assume that the stimulus is detected if some representation within the model exceeds the detection threshold.
Acquired Information: The Information Extraction Rate
Once the sensory response function a(t) exceeds the sensory threshold q (as implied by aQ(t) > 0), information is extracted at some acquisition rate . This function represents the instantaneous rate of information acquisition, and is proportional to a) the above-threshold sensory response and b) the proportion of remaining, to-be-acquired, stimulus information. The first part of this expression, the above-threshold sensory response, is given by Eq. 3. For the second half of the expression, define I(t) as the proportion of information acquired by time t. The proportion of remaining stimulus information is simply [1.0 - I(t)]. The derivative of I(t) with respect to time is defined as,
= aQ(t) Eq. 4
where cs represents the constant of proportionality and completes the equality. The model parameter 1/cs also represents the rate at which new (i.e., previously unsampled) features are sampled, although the overall rate of feature acquisition decreases over time as I(t) rises (since 1.0- I(t) decreases).
Busey and Loftus (1994) demonstrated that, with this rate function, the equation relating total acquired information, which we designate I(°), to the above-threshold area under aQ(t), AQ(°), becomes,
I(°) =
. Eq. 5
Performance Predictions
In order to make quantitative predictions the theory requires one additional linking assumption. Total acquired information I(°) is assumed to equal p, the proportion of correctly recalled digits. Thus,
p =
Eq. 6
Equation 6 summarizes an important prediction: that performance is directly related to the above-threshold area under the a(t) function. This relationship is a consequence of Eqs. 1, 3, 4 and 5 and is not an assumption of the theory.
Accounting for Two-Pulse Detection Data: The Role of Temporal Inhibition
Eq. 6 can be used to make predictions for contrast threshold data, which is common in detection experiments. In this case, p is fixed at the performance threshold (usually 75% correct detection), and the predicted contrast F is systematically varied to produce a predicted performance near 75% for a given set of parameters (t, r, s and q). A condition that produces more above-threshold area (for example, a longer stimulus duration) will require a smaller contrast to keep performance at 75% correct. Thus contrast threshold is proportional to the above-threshold area.
Figure 3 plots two-pulse data from Rashbass (1970), along with a description of how the LST model, modified for Temporal Inhibition via Eq 2, accounts for the crossover in sensitivity that occurs between the light/light and light/dark conditions as ISI increases. For a 5 ms ISI, sensitivity is quite good for the light/light condition, but decreases with increasing ISI. This results both from the separation of the two pulses pushing less area about the sensory threshold, as well as the inhibitory lobe from the first pulse reducing the excitatory lobe from the second pulse. Both of these result in less area above threshold, and thus predict a decrease in sensitivity as ISI increases by Eq 6.
The opposite situation exists for the light/dark stimuli. Sensitivity is poor at very short ISIs, since the two pulses tend to cancel. However, as the ISI is increased, the inhibitory negative lobe from the first pulse facilitates the negative-going excitatory lobe from the second pulse, which is assumed to be negative going since it has a negative contrast relative to the first pulse. The facilitation between the two negative-going lobes increases the above-threshold area and thus improves the overall sensitivity to the Light/Dark stimulus at intermediate ISIs by Eq 6.
Insert Figure 3 About Here
Empirical Evidence for Inhibition in Character Detection and
Identification Tasks
The picture that emerges from the two-pulse detection literature is fairly clear: The temporal inhibitory mechanisms depend on the adaptation level, as well as on stimulus size. Stimuli that are shown on adaptation levels higher than about 5 cd/m2 and are larger than about 13 arc min (or containing spatial frequencies larger than .75-1.5 cycles per degree) show evidence of producing inhibitory responses. Delay intervals between the two pulses of about 40-70 ms provide the largest evidence for temporal inhibition. The two-pulse paradigm with both positive- and negative-contrast pulses is perhaps the best paradigm for demonstrating evidence for temporal inhibition.
The goal of the three experiments discussed below is to test these questions: Do character detection and identification tasks show evidence of temporal inhibition that is consistent with the two-pulse detection literature? If so, can the character identification model of Loftus and Busey (Loftus, Busey & Senders, 1993; Busey & Loftus, 1994) be extended to account for this temporal inhibition? Experiment 1 generalizes detection tasks to digit stimuli, using a task in which digits are the to-be-detected stimuli. This verifies that temporal inhibition exists for the detection of characters presented at threshold contrasts. Experiment 2 is a traditional character identification task that examines whether temporal inhibition exists in a digit recall task under supra-threshold conditions. Experiment 3 measures the range of temporal frequencies that contribute to character identification, using digits flickered at different temporal frequencies. Experiment 1 focuses more on the initial visual processing stages, because the identity of the digit is not important. Experiment 2 then examines the temporal inhibition present when the task requires identification of the digit. Experiment 3 provides converging evidence for Experiment 2 using a different paradigm, which discriminates between candidate neural models of temporal inhibition.
Experiment 1: Threshold Detection of Pulsed Digits
To investigate whether character stimuli show the same patterns of temporal inhibition, a single digit was presented for two pulses. This two-pulse stimulus could appear in either of two temporal intervals delimited by low tones. Observers made a two-temporal-interval-forced choice (2TIFC) response of the interval that contained the stimulus. Four types of conditions were used: ++, +-, -+ and --, where + indicates a positive-contrast pulse and - indicates a negative-contrast pulse. The second pulse was delayed for one of 6 intervals ranging from 0 to 120 ms. The pulses were presented in one of two temporal intervals, and the subject had to identify the correct interval; this made the identity of the digit irrelevant. The contrast of the digits was manipulated to determine the contrast that yielded 82% correct detection of any digit. We then modeled these contrast thresholds with the theory to see if the linear filter model of Loftus and Busey could account for the obtained same-contrast and different-contrast thresholds.
Method
Stimulus presentation and response collection was carried out on a Macintosh II computer.
Observers
Three observers, the author (TB), and two male graduate students (SD and MB) participated in Experiments 1 and 2. All observers had participated in a minimum of 1000 trials prior to participating in Experiment 1.
Stimuli and apparatus
The Experiment was controlled by a Macintosh II computer and stimuli were presented on an Apple Monochrome monitor with a Video Attenuator (Pelli & Zhang, 1991) to allow a greater range of luminance values. The screen was calibrated using a Minolta Spotmeter LS110 and the calibration techniques that are part of the VideoToolbox library (Pelli, 1997). Observers sat approximately 57 cm away from the screen in a dimly lit room, and used the computer keypad to respond. The refresh rate of the monitor was 67 Hz.
The background luminance was set to 7.2 cd/m2, and the fixation point had a luminance of 3.7 cd/m2. Contrast was defined as (Pulse Luminance - Background Luminance)/(Pulse Luminance + Background Luminance).
The digits were either a 2 or a 5, drawn in Times-Roman 14 point font. The digits were each 0.50° high by 0.40° wide and always appeared centered vertically on the screen. The top portion of the digit was 0.27° below the fixation point.
Design
In all trials, the duration of each of two pulses was 30 ms (two screen refreshes). The two pulses were separated by one of 6 delays, ranging from 0 ms to 120 ms, during which the screen was the background luminance. The two pulses were always of the same magnitude, although may have been in different directions (positive or negative contrast) depending on the stimulus condition.
Observers completed 25, 96-trial blocks, which provided 100 observations per condition per observer.
Procedure
A trial consisted of two temporal intervals separated by low tones, each of which contained the stimulus with 50% probability. After the second interval, the observer indicated which of the two intervals they thought contained the digit, guessing if necessary. He then received feedback in the form of a tone.
An extension of this experiment was conducted with Observer TB, in which the identity of the digit, which was either 2 or 5, was the relevant stimulus attribute.
An adaptive threshold finding procedure, Quest (Watson and Pelli, 1983), was used to adjust the contrast of the pulses to find the contrast that provided an 82% detection rate. This value optimizes the efficiency of the Quest threshold-finding procedures. The Quest procedure presents the stimulus at various contrasts and uses the resulting detection performance to estimate the contrast threshold. Detection performance at all contrasts is used to estimate the contrast threshold.
Results and Discussion
Figure 4 shows the results of three observers for Experiment 1; the smooth curves through the data points are explained below. For an interstimulus interval (ISI) of zero, observers were more sensitive to the same-contrast condition than the opposite-contrast condition, which is indicated by the lower threshold for the former condition. However, when a small ISI of 15 to 30 ms was introduced, this pattern reversed and all three observers became more sensitive to the opposite-contrast condition than to the same-contrast condition. These differences between the two conditions grew largest at 30-45 ms ISI. At the empirical level, these results show that digit stimuli show the same kinds of detection curves that others classically report for simpler stimuli such as disks or gratings. Thus it appears that the detection of digit stimuli is similar to the detection if disks and gratings.
At the theoretical level, these findings clearly implicate temporal inhibition, and the results are quite similar to Ohtani and Ejima (1988). For comparison with their results, our background level of 7.272 cd/m2 translates to about 90 td, which is in the mesopic range.
Insert Figure 4 About Here
Temporal inhibition as revealed by this task begins at delay intervals of 15-30 ms, and extends to 45-70 ms for the three observers. This is consistent with the ranges given in Table 1, which show temporal inhibition for delay intervals of 30-70 ms for a variety of display conditions.
Differences Across Conditions
The results in Figure 4 are averaged across the sign of the contrast of the stimuli (++ and -- were averaged together, as were +- and -+). Consistent with previous findings (Blackwell, 1963; Boynton, 1972; Rashbass, 1970) no systematic differences were found between the individual conditions. The exception was Observer SD, who had small but systematic deviations between the ++ and -- contrast thresholds. He was more sensitive to the -- condition, which had a dark character on a gray background. However, these deviations were consistent across ISI, and thus the average does not distort the overall pattern of the contrast thresholds.
Identification Task
For direct comparison with the detection task in Experiment 1, identification data was collected for Observer TB. The stimuli were either a 2 or a 5, and the contrast of the digits was systematically adjusted until identification performance reached 81%. The stimuli were presented in a single temporal interval; display conditions were otherwise identical to those of Experiment 1.
The obtained identification contrast sensitivity values are shown in the lower-right panel of Figure 4, along with the corresponding model fit that includes a biphasic impulse-response function. The effects of temporal inhibition for Observer TB in the identification task (see Figure 4, lower-right panel) appear to exist for slightly longer display intervals (45-70 ms) than compared to the detection task for the same observer (30-70 ms). In addition, the size of the temporal inhibition appears slightly smaller for the identification task. In general these data are not much different than the detection data, although the overall sensitivity values differ (note the different ordinate scales).
Model Predictions
The Linear-Filter Model of Character Identification was applied to the data in Figure 4. Following Busey & Loftus (1994, Experiment 6), the theory may be extended to detection tasks by assuming that detection performance is proportional to the above-threshold area under the sensory response function. This theory, which assumes a monophasic impulse response function and therefore no temporal inhibition, is clearly disconfirmed by the evidence for temporal inhibition seen at 45-60 ms ISI ranges in all three observers. However, the model may be extended to account for these data by adopting Watson's working model's characterization of the impulse response function (see Eq. 2). The best-fitting predictions for the revised model are shown in Figure 4. These fits are all quite acceptable, except perhaps for Observer TB at a 120 ms ISI. The parameter values for each fit are given in Table 3, and are all in reasonable ranges.
Experiment 1 clearly demonstrates evidence for temporal inhibition in a detection task in which characters are used as stimuli. This is true even when the identity of the digit becomes important, as with Observer TB. These results indicate the need for a model that assumes temporal inhibition, and when the character identification theory is modified to include this component it provides good predictions for the detection data.
Insert Table 3 About Here
Experiment 2: Identification of Different-Contrast Pulsed Digits
Experiment 1 verified that digit stimuli in a threshold detection task exhibit the same pattern of temporal summation under the same background luminance levels as more simple stimuli such as disks or gratings. For the most part, however, the actual identity of the digit was not perceptible; observers merely had to indicate which temporal interval contained any contrast change. Therefore Experiment 1 did not call upon the cognitive mechanisms that are involved in recognizing and remembering the identity of the digits. Experiment 2 was designed to introduce these cognitive mechanisms, to see how temporal inhibitory processes might influence these processes at supra-threshold contrast levels. This experiment also provides a direct test of the linear-filter model of character identification of Busey and Loftus (1994), which did not assume temporal inhibition.
The task in Experiment 2 is similar to other information processing tasks. In this task, four digits are presented to an observer whose job is to report as many of them as possible, in their correct order, guessing if necessary. The basic performance measure, p, is the proportion of correctly-reported digits, in their correct locations, adjusted for the guessing probability.
Method
Stimulus presentation and response collection were identical to Experiment 1. In Experiment 2 observers viewed two-pulse presentations of four digits. Their task was to perceive the digits and type them into a computer keypad. The same durations and delays used in Experiment 1 were used.
Observers
Three observers, the author (TB), and two male graduate students (SD and MB) participated in both Experiments 1 and 2. All observers had participated in a minimum of 1000 trials prior to participating in Experiment 1.
Stimuli and apparatus
The background luminance was again set to 7.272 cd/m2, and the fixation point had a luminance of 3.7 cd/m2. For two observers (SD and TB) the contrast of the digits was either 6.5% or -6.5% depending on the nature of the condition. A third observer (MB) had contrasts of 7% and -7%. Contrast was defined as (Foreground Luminance - Background Luminance)/(Foreground Luminance + Background Luminance).
The stimuli were adapted from those used in related information processing tasks (e.g. Sperling 1960), and consisted of 12 digits arranged in 3 rows. Observers reported the contents of one row of 4 digits in order, guessing if necessary. Unlike partial report tasks, the observer knew in advance which row to attend to, which was the same for all trials in a block. The use of 3 rows was maintained for consistency with previous information-processing experiments (e.g. Loftus et al, 1993; Busey & Loftus, 1994). To avoid masking effects from the fixation point, only the top and bottom rows were used. The digits were each 0.50° high by 0.40° wide, and the edges of each digit were separated by 0.75° vertically and 0.40° horizontally. This stimulus configuration implies eccentricity differences between the stimuli in Experiment 1 and 2. However, in both experiments the observer was free to fixate on any portion of the display deemed most productive. The observers reported looking at a point in between positions 2 and 3. The entire row of digits spanned 0.50° vertically and 2.8° horizontally.
Design
In all trials, the duration of each of two pulses was 30 ms (two screen refreshes). The two pulses were separated by one of 6 delays, ranging from 0 ms to 120 ms, in which the screen was the background luminance. Observers completed 24, 72-trial blocks, which provided 72 observations per condition per observer.
Procedure
A trial began with a 250-ms warning tone, that reminded observers of the to-be-reported row (either the top or bottom row) for the current block of trials. Following the tone, the digit array was displayed in its appropriate temporal configuration, followed 375 ms later by a response tone. The observer then tried to type the digits in their proper order, guessing if necessary. The observers then received feedback in the form of four tones.
Results and Discussion
Figure 5 shows the results for three observers in Experiment 2, averaged across conditions. For a zero ISI condition performance is much worse in the opposite-contrast conditions, but gradually increases as ISI increases. At a 45 ms ISI, opposite-contrast performance exceeds same-contrast performance for two observers (TB and SD), which is consistent with the results reported in the two-pulse detection literature and Experiment 1. The third observer (MB) does not show this reversal.
At a theoretical level, the data from Observers TB and SD clearly indicate effects of temporal inhibition.
Insert Figure 5 About Here
Temporal inhibition as revealed by this digit-recall task begins at delay intervals of 30-45 ms for the two observers who demonstrate temporal inhibition. The size of these inhibitory effects is reduced compared to the detection data. This is consistent with the identification data for Observer TB in Experiment 1, whose data also showed a reduced inhibitory effect compared to the detection data. However, as in Experiment 1, the range of empirically determined temporal inhibitory effects are consistent with those reported in Table 1. This suggests that the temporal processing of character information and simpler stimuli such as disks and gratings follow similar mechanisms.
Differences Across Conditions
As in Experiment 1, no systematic differences between the ++ and -- or -+ and +- conditions were observed. However, as in Experiment 1 the sole exception was Observer SD, who had small but systematic deviations between the ++ and -- contrast thresholds. Consistent with his Experiment 1 contrast thresholds, he performed better in the -- condition than the ++ condition. However, these deviations were again consistent across ISI, and thus the average does not distort the overall pattern of the obtained performance levels.
Observer TB shows consistent deviations from the model predictions at the 120 ms ISI for both Experiment 1 and 2. The reasons for these are not clear. The long ISI could have provided an opportunity for eye movements to intervene. This would cause the second presentation to appear on a different portion of the retina and therefore not sum completely with the first pulse.
Model Predictions
Model predictions for Experiment 2 were generated using the LST model, modified to include temporal inhibition. These predictions are shown as curves in Figure 6. The need for an impulse response function with an inhibitory lobe is clearly indicated by superior performance at the 45 ms delay for the opposite-signed condition than the same-signed condition. Observers TB and SD demonstrate this pattern, although observer MB does not. Figure 4 demonstrates how temporal inhibition accounts for this data pattern, where more above-threshold area produces a prediction of better performance.
The best-fitting model parameters for these fits are given in Table 3. In general the fits are quite good, with the exception of slight underprediction for Observer TB at short ISI for the opposite-contrast condition, and slight overprediction for the long ISI condition for the same observer. The slight performance decrement below the probability summation prediction for the 120 ms ISI for Observer TB is consistent with the slightly higher contrast thresholds at the same ISI for this observer in Experiment 1.
Observer MB did not show any temporal inhibition at any ISI delay as indicated by the criteria of opposite-contrast performance greater than same-contrast performance. Therefore a version of the character identification theory that does not assume temporal inhibition might fit these data. Such a fit was attempted and parameter values are given in Table 3, but the fit was quite poor, as evidenced by the graph in the lower-right panel of Figure 5 and larger root-mean-squared error given in Table 3 for this fit relative to the version that includes temporal inhibition.
The model fits shown in Figure 5 are fairly close to the data, but for two of the three observers they were obtained using model parameters that are, on the surface, perplexing. For observers MB and SD the ratio of the inhibition t to the excitatory t was less than 1.0. This implies an impulse-response function that has an initial small negative lobe and then a much larger positive lobe. Although most models of Two-Pulse data have an initial excitatory lobe, both Roufs and Blommaert (1981) as well as den Brinker (1988) consider impulse response functions with initial inhibitory lobes. However, two-pulse thresholds do not contain phase information, and therefore will be equally well-fit by a time-reversed impulse response function (Andrew Watson, personal communication, 10/30/95). In this case a small negative lobe followed by a large positive lobe is equivalent to a large positive lobe followed by a small negative lobe. Thus exactly equivalent fits could be obtained by restricting r to be greater that 1.0 in the case of observers MB and SD.
Effect of Background Luminance
One issue not addressed by the Experiment 2 data is the dependence of temporal inhibition on the background luminance seen in Table 1. This is particularly important for characters, since they are often viewed in high luminance conditions, which may affect the amount of temporal inhibition. Ikeda (1986) and others have pointed out the dependence of temporal inhibition on the background luminance level, and this dependence was verified with Observers MB and TB in several small experiments. These two observers were re-tested using the Experiment 2 paradigm at the 45 ms ISI delay, which was chosen because it gives the largest evidence for temporal inhibition. The background luminance was raised by 0.5 log units, to 11.7 cd/m2 and the contrast was decreased to 5% for observer MB and 4.5% for observer TB to keep performance off the ceiling.
Observer MB, who did not show evidence of temporal inhibition at 7.272 cd/m2 background level, shows strong evidence for temporal inhibition at this higher background level. Opposite-contrast performance was 0.660 while same-contrast performance was 0.501. Observer TB showed the same pattern; opposite-contrast performance was 0.437 while same-contrast performance was 0.309. The standard error for each mean was about 0.029. Two-tailed t-tests are significant for both observers at a = 0.05, demonstrating a pattern that is consistent with the detection literature and which implies temporal inhibition at a theoretical level.
According to the results summarized in Table 1, lower background luminance levels do not produce temporal inhibition. To verify this for the character identification task, data was collected from observer TB at the 45 ms ISI delay condition for a background level that was 0.5 log units lower in luminance (1.17 cd/m2) than Experiment 2's background luminance. Contrast was set at 9% to raise performance above chance.
As anticipated, this condition produced no evidence for temporal inhibition; indeed the same-contrast performance (0.379) was actually better than the opposite-contrast performance (0.306). The standard error for each mean was about 0.030. This pattern is consistent with the detection literature at lower background luminance levels in which same-contrast and opposite-contrast performances do not differ. At a theoretical level this does not imply temporal inhibition.
These results confirm the dependence of temporal inhibition on the background luminance level. Although a complete parametric study was not attempted, these data confirm the finding that background luminance levels around 1 cd/m2 do not produce temporal inhibition, while luminance levels above 7 cd/m2 do produce temporal inhibition in most observers.
Experiment 3
Experiments 1 and 2 demonstrate evidence for temporal inhibition in both detection and character identification tasks when digits are used as stimuli. Several authors have proposed mechanisms that underlie temporal inhibition, and the goal of Experiment 3 is to discriminate between these explanations.
Ikeda (1986) noted the dependence of temporal inhibition on stimulus size, and echoed the suggestion of Purcel and Stewart (1971), who suggested that lateral inhibitory mechanisms in the retina might underlie temporal inhibition. This mechanism was further elaborated in the context of the lateral inhibitory mechanisms associated with center-surround cells in the retina by Enroth-Cugell, Robson & Scheitzer-Tong (1983). Under this model, temporal inhibition results from interactions within a channel, in which the surround of a center-surround cell inhibits the excitatory response from the center of the receptive field.
A second mechanism involves transient-on-sustained inhibition, in which the onset or offset of the second pulse generates a rapid transient response that inhibits the slow excitatory processing of the first pulse. This form of inhibition has been used to explain a variety of masking processes (e.g. Breitmeyer, 1984). Breitmeyer (1984) also suggests that the abrupt onsets associated with the second pulse of a two-pulse stimulus could generate a fast inhibitory response in the transient channel that inhibits the processing of the first pulse. Thus the across-channel inhibitory mechanisms might produce inhibitory effects that are consistent with temporal inhibition. Under this model, the temporal inhibition results from activation of the transient system by the abrupt onset of the second pulse.
Experiment 3 addresses the role of abrupt onsets in temporal inhibition by removing all abrupt onsets from the stimulus and determining whether evidence for temporal inhibition persists. The tasks involves flickering a digit at different temporal frequencies and adjusting the contrast of the digit until it is just barely identifiable. Although the task is to identify the digit, the intrinsic temporal properties of the system mediating character identification will place limits on the sensitivity of character identification at different flicker rates. Evidence for temporal inhibition comes from the very slowest flicker rates. Temporal inhibition acts to sharpen the response in the visual system, but for a very slowly flickering stimulus, the system tends to inhibit itself. This results in a decrease in sensitivity at very slow temporal frequencies (e.g. 2 Hz) relative to moderate frequencies (e.g. 8 Hz), and this characteristic fall-off in sensitivity is a signature of temporal inhibition. When sensitivity measured at different temporal frequencies is plotted against the flicker rate, a temporal contrast sensitivity function (TCSF) plot can be used to derive the impulse-response function and demonstrate evidence for temporal inhibition. A TCSF plot that includes this fall-off at low temporal frequencies is directly associated with a bi-phasic impulse response function, as demonstrated by Figure 2.
Alexander, Xie & Derlacki (1994) flickered letters at 2 and 16 Hz, and measured spatial contrast sensitivity functions (SCSF). At the 2 Hz flicker rate they found that the SCSF differed systematically from a function generated from gratings. Such differences did not exist at the 16 Hz flicker rate. The differences at 2 Hz demonstrate that temporal frequency has a systematic effect on the relation between contrast sensitivity and stimulus type. Alexander et al (1994) did not construct a full temporal contrast sensitivity function for their letters.
The TCSF paradigm used in Experiment 3 has several disadvantages compared with the Two-Pulse paradigm used in Experiments 1 and 2. The primary advantage of the Two-Pulse stimulus is that it is temporally compact, such that the longest stimulus durations are in the range of 120-150 ms. Evidence for temporal inhibition in the TCSF paradigm comes from very slow temporal frequencies, necessitating very long (500 ms) stimulus durations. These long durations may allow eye movements which can improve sensitivity at slow temporal frequencies, and obscure evidence for temporal inhibition. I return to this issue in the discussion of Experiment 3. However, this issue does imply that the TCSF paradigm is perhaps less powerful that the Two-Pulse paradigm when used to determine evidence for temporal inhibition. Despite this, a finding of any evidence for temporal inhibition in the TCSF paradigm does rule out transient-on-sustained inhibition as the sole mediator of temporal inhibition.
Method
Contrast sensitivities at seven temporal frequencies ranging from 2 to 24 Hz were obtained by flickering a digit around a gray background according to a sine-wave weighted by a gaussian envelope with standard deviation of 83 ms. Example temporal functions are shown in Figure 6. The stimuli were presented on a Tektronix 604 oscilloscope with a fast P15 phosphor at a 4 ms (250 Hz) refresh rate. To maintain a constant mean luminance, the formula for contrast was changed to contrast = (Lmax - Lmin)/(Lmax + Lmin).
Insert Figure 6 About Here
Participants
Three participants, the author and two graduate students, completed 80 trials at each temporal frequency.
Stimuli and Apparatus
Observers viewed an illuminated patch subtending 1.40° vertically and 1.57° horizontally on the face of a Tektronix 604 oscilloscope. The background luminance was fixed at 20 cd/m2, about 3 times higher than in Experiments 1-2. Stimuli were a 2 and a 5, rendered in the same Times font used in Experiments 1-2. Observers viewed these stimuli from a distance of 86 cm, which resulted in the digits subtending 0.50° vertically and 0.37° horizontally.
Design and Procedure
On each trial the contrast of the pulses was determined by the same adaptive search technique (Quest, Watson & Pelli, 1983) used in Experiment 1, which determines the stimulus contrast that affords 82% correct identification over trials. The resulting contrast threshold is converted to sensitivity by taking the logarithm of the inverse of the contrast threshold.
Results and Discussion
Figure 7 shows TCSF data from Experiment 3 for three observers. Observer TB's data show a decrease in sensitivity at low temporal frequencies, which demonstrates clear evidence for temporal inhibition. Data from Observer NB demonstrates a flattening of the TCS function at low temporal frequencies, which requires a model-based interpretation to determine whether temporal inhibition is required to account for the data. Observer AA shows no sign of a fall-off at low temporal frequencies, and therefore does not require temporal inhibition to account for her data.
Insert Figure 7 About Here
The Experiment 3 data are modeled by computing the Fourier transform of the impulse response function (Eq. 2) and fitting a model that consists only of the impulse response function parameters (t, r and s) along with a sensitivity parameter g that scales the TCSF vertically. This is the Transfer function G(w):
Eq 7
where w is the temporal frequency flicker rate for a condition, g is a sensitivity parameter and t, r and s are the impulse-response function parameters from Eq. 2 that determine the range of temporal frequencies that underlie a given task. The model fits are shown in Figure 7, along with the recovered impulse response functions. In some cases the data did not require temporal inhibition, which was accomplished by setting s to zero, eliminating the second part of Eq. 7. Table 3 shows the best-fitting model parameters for each observer.
Observer TB's data are not well-fit by a monophasic impulse response function (dashed curve) and clearly demonstrates evidence for Temporal Inhibition. Observer NB's data deviate from a monophasic prediction at the 2 Hz flicker rate. One technique that can be used to determine whether temporal inhibition is required to account for observer NB's data is to compute confidence intervals around the impulse response function parameters. If the confidence interval around the s parameter do not include zero, temporal inhibition appears necessary to account for the character identification data.
Confidence intervals around the impulse response function parameters are determined as follows. A distribution of impulse response function parameters was created by using the variability estimates of the thresholds that come from the standard error of the mean (SEM) for each threshold. From the threshold estimates and the SEM for each threshold, a new distribution of threshold estimates was constructed by selecting from a normal distribution with mean equal to each threshold and standard deviation equal to the SEM for that threshold. This created a new set of 7 thresholds. The parameter fitting procedures were then applied to this new set of thresholds and values of t, r, s and g were recorded. This procedure was repeated 200 times, and the resulting 200 values of the parameters construct a sampling distribution with some mean and standard deviation. These distributions can be used to construct confidence intervals around the parameter estimates for Observer NB. The 95% confidence intervals for t, r, s and g were 6.29±0.397, 1.02±0.0265, 0.681±0.0471, and 57.6±5.14 respectively. Since the confidence interval for the s parameter does not include zero, we conclude that Observer NB requires a model that includes temporal inhibition.
The data from Observer AA shows no evidence of a fall-off in sensitivity at low temporal frequencies, and her data is well-fit by a model that does not include temporal inhibition. Observer AA was not as well practiced as Observers NB and TB. To verify that this difference did not influence her data, Observer AA replicated her results in a second round of data collection. The results are shown as Round 2 in Figure 7, and demonstrate that practice slightly improved her overall sensitivity, by an average of 0.075 log units. However, the overall data pattern is the same. For a model that assumes temporal inhibition, the 95% confidence interval around the s parameter includes 0 (0.11±0.22). This indicates that the estimated value for the temporal inhibitory model component is not reliably different from zero and confirms the findings from Round 1: Observer AA does not show evidence of temporal inhibition in the character identification task. Observer AA has no evidence of any reading disorder, and her optical correction is quite small (a small, 1 diopter myopia correction) that corrects her to 20/20 vision.
The Experiment 3 data demonstrate that, at least for some observers, temporal inhibition exists in tasks where there are no abrupt onsets. This suggests that the inhibitory mechanisms are not driven solely by abrupt onsets or offsets, but instead represent the intrinsic temporal properties of the pathways responsible for character identification. The evidence for temporal inhibition is not as strong as in Experiments 1 and 2, and perhaps better evidence for a fall-off at slow temporal frequencies would come from 1 Hz or even .5 Hz flicker rates. However, these stimuli were not used since they leave open the possibility that an eye movement will occur during the stimulus presentation. An eye movement has the tendency to introduce higher temporal frequencies into the stimulus as the eye moves across a fixed spatial pattern. These higher temporal frequencies are compatible with the temporal inhibitory responses and are therefore not reduced by the inhibitory portion as are slower temporal frequencies. The result is an increase in sensitivity at very low temporal frequencies, diminishing the evidence for temporal inhibition. These mechanisms may have been at work in even the 500 ms (2 Hz) presentations of Experiment 3, which may explain the relatively weak evidence for temporal inhibition in the TCSF data.
General Discussion
Four general conclusions are possible from Experiments 1-3. First, together these experiments provide clear evidence for temporal inhibitory processes in the mechanisms that subsume character detection and identification. The data in both the detection and identification tasks is consistent with the two-pulse detection literature findings, and the range of temporal inhibition delay intervals corresponds to the range reported in the literature (see Table 1). Thus it appears that digits are similar to disks and gratings in both detection and identification tasks.
Second, these results disconfirm a linear-filter model with a monophasic impulse response function (Loftus, Busey & Senders, 1993; Busey & Loftus, 1994) for three different tasks involving characters.
Third, when the previous character identification model was altered to allow for temporal inhibition, overall it provided a good account of the data in both experiments.
Fourth, the Experiment 3 data found evidence for temporal inhibition in stimuli that did not contain abrupt onsets, suggesting that the transient-on-sustained inhibition proposed by Breitmeyer & Ganz (1977) cannot be the sole mediator of temporal inhibition. In addition, transient-on-sustained inhibition cannot explain the facilitory processes, since the transient inhibition appears to affect both on- and off-center sustained neurons in the cat (Breitmeyer 1984) and is thus indifferent to contrast polarity. An explanation based on interactions between the magnocellular and parvocellar parallel pathways that extend through the LGN cannot account for the temporal inhibition, since the temporal response properties of the neurons in each pathway are quite similar, and both demonstrate evidence for temporal inhibition (Hawken, et al, 1996). The Experiment 3 data demonstrate that some form of within-channel inhibition is sufficient to generate temporal inhibition. One possible mechanism is the proposal by Purcel and Stewart (1971), who suggested that lateral inhibitory mechanisms in the retina might underlie temporal inhibition. However, this does not rule out the contributions of cortical mechanisms to the temporal properties of the visual pathway responsible for character identification.
Previous Linear-Filter Modeling
There are several reasons why a biphasic impulse response function might not have been necessary to successfully model previous character identification experiments. First, the background luminance might have been low enough such that an inhibitory response was not generated. The background luminances of previous experiments (e.g. Loftus, Busey & Senders, 1993; Busey & Loftus, 1994) were somewhat lower, 3.5 cd/m2, which is below the approximate 5 cd/m2 background given by Sperling and Sondhi as necessary for the appearance of biphasic response functions. Second, the experiments used to test the linear-filter model may not have been powerful enough to disconfirm a monophasic response function.
The version of the theory that includes a monophasic response function is a subset of all biphasic models; thus any data that is well-fit by a monophasic version of the theory will also be fit by a biphasic version of the theory.
Despite the good fits of the linear filter version of the theory to the character identification data, the current conceptualization of the model is limited by the fact that there is no mechanism within the model to control parametrically the inhibitory component of the impulse response function with changes in the background luminance level. This is one advantage of the Sperling and Sondhi (1968) model. Perhaps the character identification model can be adapted to include their conceptualization of the impulse response model based on gain control mechanisms. As with most models of two-pulse performance, the amount of temporal inhibition is represented by a delayed inhibitory signal, although the physiological analog of this signal remain to be characterized.
The results of these experiments demonstrate the need to consider the possibility of temporal inhibition in information processing tasks that are normally placed outside the domain of simple detection experiments. The present studies used near-threshold stimuli on relatively dim backgrounds, and still found evidence for temporal inhibition. The detection literature demonstrates that as the background luminance increases, the amount of temporal inhibition increases as well. In addition, experiments by Bowen (1989) demonstrate that biphasic or even multi-phasic impulse response functions are necessary to account for a phenomenon in which two supra-threshold pulses produce three apparent flashes. This suggests that inhibitory mechanisms are at work in luminance conditions normally associated with reading and other information processing tasks.
Implications For Developmental Dyslexia and Transient/Sustained Inhibition
The present work incorporates temporal inhibition into an information processing model, which is a first step toward linking the lower-level visual and higher-level cognitive mechanisms. Such links are particularly important for disorders such as developmental dyslexia, in which a deficit in high temporal frequencies has been found (Lovegrove, Garzia & Nicholson, 1990; Williams & Lecluyse, 1990). Previously it was unknown whether reading performance could be linked to the flicker-fusion experiments used to test temporal acuity and reading performance. Several authors (Breitmeyer, 1993; Lovegrove, 1993) have suggested that the detection and identification of letters and words is carried out by the parvocellular/sustained system during eye fixations. The magnocellular/transient system is thought to inhibit the parvocellular/sustained system during each saccade, preventing the information from a previous fixation from overlapping with the new information from a new fixation. A deficit in the magnocellular system, as suggested by a loss in high temporal frequency sensitivity, suggests that this necessary inhibition is not taking place. However, Burr et al (1994) found evidence only for suppression of the magnocellular/transient system during eye movements, casting doubt on this theory. In addition, if the parvocellular/sustained system processes only higher spatial frequencies, these frequencies are effectively blurred during a rapid eye movement and need not be suppressed.
More recent studies have failed to find a transient deficit in adults and children with dyslexia. Walther-Müller (1995) tested dyslexic children on temporal contrast sensitivity, line-spread, visual persistence and short-range motion tasks. He found no systematic differences between control and dyslexic groups in any of the four tasks. Hayduk, Bruck and Cavanagh (1996) report no differences between control and dyslexic children and adults on temporal contrast sensitivity and visual search tasks. Unfortunately, in both these studies the temporal contrast sensitivity measurements were conducted with a 2 cycle-per-degree grating. In two-pulse tasks this spatial frequency shows evidence of temporal inhibition only at very high luminance levels (150 cd/m2) (Ohtani and Ejima, 1988). In Ohtani and Ejima's two-pulse experiment, no temporal inhibition was found for 1.5 cpd stimuli on background levels of 8 cd/m2, which is close to the stimulus value of 14 cd/m2 used by Hayduk et al. and 11 cd/m2 used by Walther-Muller. It is possible that the stimuli used to test dyslexic children did not produce temporal inhibition in either controls or dyslexics and therefore could not show a transience deficit. A grating with spatial frequencies larger than .75 cpd produced temporal inhibition at almost all adaptation levels in the Ohtani & Ejima (1988) study. The stimuli used in the search tasks of Hayduk et al were 1° flickering disks, and probably large enough to create temporal inhibition. However, this was a visual search task with supra-threshold stimuli, and is therefore not directly comparable to the temporal contrast sensitivity task.
Despite the apparent inadequacy of the magnocellular/transient explanation of dyslexia, a link between a high temporal acuity loss and dyslexia might still be possible. Rayner and Pollatsek (1989), for example, have argued for a deficit in the eye movements of dyslexics, which may be driven by a mechanism that includes temporal inhibition. The current experiments demonstrate that information processing tasks rely on temporal inhibition, which is a mechanism that improves temporal acuity. If temporal inhibition plays a central role in reading mechanisms, then this would account for the observed correlation between dyslexia and a loss in temporal acuity. Temporal inhibition may inhibit processing in a single pathway and prevent information from one eye fixation from persisting into the next. One possible test of this hypothesis is to determine whether persons with dyslexia produce evidence for temporal inhibition in a two-pulse or TCSF experiment. The entire temporal contrast sensitivity function should be mapped out for both control and dyslexic subjects using a .75 cpd grating, and the control subjects should show a bandpass TCSF while the dyslexic subjects should show a low-pass TCSF if a transience deficit is to be demonstrated. The results from Observer AA in Experiment 3 suggest that such a link may not exist: she shows no evidence of transience in her TCSF, yet she does not suffer from dyslexia. Clearly more evidence from a wider range of control and dyslexic observers is necessary to fully answer this question.
REFERENCES
Bergen, J. R., & Wilson, H. R. (1985). Prediction of flicker sensitivities from temporal three-pulse data. Vision Research, 25, 577-582.
Blackwell, H. R. (1963). Neural theories of simple visual discriminations. Journal of the Optical Society of America, 53, 129-160.
Bouman, M. A., & van den Brink, G. (1952). On the integrate capacity in time and space of the human peripheral retina. Journal of the Optical Society of America, 42(9), 617-620.
Bowen, R. W. (1989). Two pulses seen as three flashes: A superposition analysis. Vision Research, 29, 409-417.
Boynton, R. M. (1972). Discrimination of homogeneous double pulses of light. In Hurvich, L. M., &. Jameson, D. (Eds.), Handbook of Sensory Physiology (pp. 202-232).
Breitmeyer, B. G. & Ganz, L. (1977). Temporal studies with flashed gratings: Inferences about human transient and sustained channels. Vision Research, 17, 861-865.
Breitmeyer, B. G. (1984). Visual masking: an integrative approach . New York: Oxford University Press.
Breitmeyer, B. G. (1993). The roles of sustained (P) and transient (M) channels in reading and reading disability. In S. F. Wright & Groner, R. (Ed.), Facets of dyslexia and its remediation (pp. 13-31). Amsterdam: Elsevier.
Burr, D. C., & Morrone, M. C. (1993). Impulse-response functions for chromatic and achromatic stimuli. Journal of the Optical Society of America: A, 10, 1706-1713.
Burr, D., Morrone, M. & Ross, J. (1994). Selective suppression of the magnocellular visual pathway during saccadic eye movements. Nature, 371, 511-513.
Busey, T.A. & Loftus, G.R. (1994). Sensory and cognitive components of visual information acquisition. Psychological Review , 101, 446-469.
Cogan, A. (1987). Human binocular interaction: Towards a neural model. Vision Research, 27, 2125-2139.
Cogan, A., Clarke, M., Chan, H. & Rossi, A. (1991). Two-pulse monocular and binocular interactions at the differential luminance threshold. Vision Research, 30, 1617-1630.
Coltheart, M. (1980). Iconic memory and visible persistence. Perception & Psychophysics, 27(3), 183-228.
de Lange, H. (1958). Research into the dynamic nature of the human foveal-cortex systems with intermittent and modulated light. I. Attenuation characteristics with white and colored light. Journal of the Optical Society of America, 48, 777-784.
den Brinker, A. C. (1989). A comparison of results from parameter estimations of impulse responses of the transient visual system. Biological Cybernetics, 61, 139-151.
den Brinker, A. C. (1990). Changes with background in the linear model of the transient visual system. Biological Cybernetics, 62, 441-451.
Derrington, A. M., & Henning, G. B. (1980). Pattern discrimination with flickering stimuli. Vision Research, 21, 597-602.
Enroth-Cugell, C., Robson, J.G., Schweitzer-Tong, D.E., and Watson, A. B. (1983). Spatiotemporal interactions in cat retinal gangion cells showing linear spatial summation. Journal of Physiology, 341, 279-301.
Eriksen, C. W., & Schultz, D. W. (1979). Information processing in visual search: a continuous flow conception and experimental results. Perception and Psychophysics, 25, 249-263.
Gegenfurtner, K., & Sperling, G. (1993). Information transfer in iconic memory experiments. Journal of Experimental Psychology: Human Perception and Performance, 19, 845-866.
Georgeson, M. A. (1987). Temporal properties of spatial contrast vision. Vision Research, 27, 765-780.
Hayduk, S., Bruck, M. & Cavanagh, P. (1996). Low-level visual processing skills of adults and children with dyslexia. Cognitive Neuropsychology, 13, 975-1015.
Hawken, M. J., Shapley, R.M., & Grosof, D. H. (1996). Temporal-frequency selectivity in monkey visual cortex. Visual Neuroscience, 13, 477-492.
Hogben, J. H., & Di Lollo, V. (1974). Perceptual integration and perceptual segregation of brief visual stimuli. Vision Research, 14, 1059-1069.
Ikeda, M. (1965). Temporal summation of positive and negative flashes in the visual system. Journal of the Optical Society of America, 55(11), 1527-1534.
Ikeda, M. (1986). Temporal impulse response. Vision Research, 26, 1431-1440.
Krauskopf, J. (1980). Discrimination and detection of changes in luminance. Vision Research, 20, 671-677.
Kulikowski, J.J., & Tolhurst, D.J. (1973). Psychophysical evidence for sustained and transient mechanisms in human vision. Journal of Physiology, 232, 149-163.
Landis, T., Regard, M. Bliestle, A., Kleihues, P. (1988). Prosopagnosia and agnosia for non-canonical views. An autopsied case. Brain, 111, 1287-97.
Lennie, P. (1980). Perceptual signs of parallel pathways. Phil. Trans. R. Soc. Lond. B, 290(23-37), 23-36.
Loftus, G. R., Busey, T.A. & Senders, J. (1993). Providing a sensory basis for models of visual information acquisition. Perception & Psychophysics, 54, 535-554.
Loftus, G.R. & Irwin, D. (submitted) Visible and informational persistence: Different tasks measure different things.
Loftus, G.R. & Ruthruff, E. (1994). A theory of visual information acquisition and visual memory with special application to intensity-duration tradeoffs. Journal of Experimental Psychology: Human Perception and Performance, 20, 33-49.
Loftus, G.R. & Ruthruff, E.R. (1994). A theory of visual information acquisition and visual memory with special application to intensity-duration tradeoffs. Journal of Experimental Psychology: Human Perception and Performance, 20, 33-50.
Lovegrove, W. (1993). Do dyslexics have a visual deficit? In S. F. Wright & Groner, R. (Ed.), Facets of dyslexia and its remediation (pp. 33-49). Amsterdam: Elsevier.
Lovegrove, W., Garzia, R. & Nicholson, S. (1990). Experimental evidence for a transient system deficit in specific reading disability. Journal of the American Optometric Association, 16, 137-146.
Matin, E. (1975). The two-transient (masking) paradigm. Psychological Review, 82, 451-461.
Matin, L. & Bowen, R. W. (1976). Measuring the duration of perception. Perception & Psychophysics, 20, 66-76.
Meijer, J. G., van der Wildt, G., & van den Brink, G. (1978). Twin-flash response as a function of flash diameter. Vision Research, 18, 1111-1116.
Merigan, W. H., & Maunsell, J. H. R. (1993). How parallel are the primate visual pathways? Annual Review of Neuroscience, 16, 369-402.
Merigan, W., Katz, L., & Maunsell, J. (1991). The effects of parvocellular lateral geniculate lesions on the acuity and contrast sensitivity of macaque monkeys. Journal of Neuroscience, 11, 994-1001.
Musselwhite, M. J., & Jeffreys, D. A. (1983). Visual evoked potentials to double-pulse pattern presentation. Vision Research, 23, 135-143.
Ohtani, Y., & Ejima, Y. (1988). Relation between flicker and two-pulse sensitivities for sinusoidal gratings. Vision Research, 28, 145-156.
Ohzawa, I., & Freeman, R. D. (1986). The binocular organization of simple cells in the cat's visual cortex. Journal of Neurophysiology, 56, 221-242.
Pelli, D. G. (1997). The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision, 10, 437-442.
Pelli, D. G. & Zhang, L (1991). Accurate control of contrast on microcomputer displays. Vision Research, 30, 1033-1048.
Purcell, D. G., & Stewart, A. L. (1971). The two-flash threshold: An evaluation of critical-duration and visual-persistence hypotheses. Perception & Psychophysics, 9, 61-64.
Purcell, D., & Stewart, A. (1971). The two-flash threshold: An evaluation of critical-duration and visual-persistence hypotheses. Perception and Psychophysics, 9, 61-64.
Rashbass, C. (1970). The visibility of transient changes of luminance. Journal of Physiology, 210, 165-186.
Rayner, K. &. Pollatsek, A. (1989). The Psychology of Reading . Englewood Cliffs, NJ: Prentice-Hall.
Roufs, J. A. J. (1973). Dynamic properties of vision-III. Twin Flashes, single flashes and flicker fusion. Vision Research, 13, 309-323.
Roufs, J.A.J. & Blommaert, F.J.J. (1981). Temporal impulse and step responses of the human eye obtained psychophysically by means of a drift-correcting perturbation technique. Vision Research, 21, 1203-1221.
Rumelhart, D.E. (1970). A multicomponent theory of the perception of briefly exposed visual displays. Journal of Mathematical Psychology, 7, 191-218.
Smith, P. L. (1995). Psychophysically principled models of visual simple reaction time. Psychological Review, 102, 567-593.
Solomon, J. A. & Pelli, D. G. (1994). The visual filter mediating letter identification. Nature, 369, 395- 397.
Sperling, G. (1960). The information available in brief visual presentations. Psychological Monographs, 74, 1-29.
Sperling, G., & Sondhi, M.M. (1968). Model for visual luminance discrimination and flicker detection. Journal of the Optical Society of America, 58, 1133-1145.
Theodor, L. H. (1972). The detectability of a brief gap in a pulse of light as a function of its temporal location within the pulse. Perception and Psychophysics, 12, 168-170.
Townsend, J. T. (1981). Some characteristics of visual whole report behavior. Acta Psychologica, 47, 149-173.
Uchikawa, K., & Ikeda, M. (1986). Temporal integration of chromatic double pulses for detection of equal-luminance wavelength changes. Journal of the Optical Society of America A, 3, 2109-2115.
Walther-Müller, P. U. (1995). Is there a deficit of early vision in dyslexia? Perception, 24, 919-936.
Watson, A. B. (1978). Probability summation over time. Vision Research, 19, 515-522.
Watson, A. B. (1986). Temporal sensitivity. In K. R. Boff, L. Kaufman, and J.P. Thomas (Eds.), Handbook of Perception and Human Performance (Vol I) New York: Wiley.
Watson, A. B., & Nachmias, J. (1977). Patterns of temporal interaction in the detection of gratings. Vision Research, 17, 893-902.
Watson, A.B., & Pelli, D.G. (1983). QUEST: A Bayesian adaptive psychometric method. Perception & Psychophysics, 33, 113-120.
Williams, M. &. Lecluyse, K. (1990). Perceptual consequences of a temporal processing deficit in reading disabled children. Journal of the American Optometric Association, 61, 111-121.
Author Notes
This research was funded in part by an NIMH pre-doctoral grant to Tom Busey, and NINH grant #MH41637 to Geoffrey R. Loftus. Portions of this research were in partial fulfillment of a doctoral dissertation at the University of Washington.
Tables
|
Table 1. Summary of Stimulus Conditions and Results for Various Authors. Adapted from Ikeda (1986) and individual articles. |
|||||||||||
|
Authors |
Test size |
Adapting Level |
Inhibition |
t = temporal delay of inhibition |
Conditions |
||||||
|
Ikeda (1965) |
30' |
61.2 td 328 td |
yes yes |
t = 70 ms t = 53 ms |
++, +-, -+, - - |
||||||
|
Rashbass (1970) |
17° |
700 td |
yes |
t = 66 ms |
++, +-, -+, - - |
||||||
|
Purcell & Stewart (1971) |
38' |
25 ft-l |
yes |
-- |
|||||||
|
Theodor (1972) |
10' |
25 ft-l |
yes |
-- |
++ |
||||||
|
Roufs (1973) |
1° |
1 td 42 td 1200 td |
no yes yes |
-- t = 56 ms t = 32 ms |
++ |
||||||
|
Breitmeyer & Ganz (1977) |
1.0 c/deg 10 c/deg |
5.0 ft-l |
yes no |
t = 80 ms |
Gratings: ++ |
||||||
|
Watson & Nachmias (1977) |
2° x 1.5° 1.8 c/deg 3.5 c/deg 7.0 c/deg 10 c/deg |
15 cd/m2 |
yes yes no no |
t = 60 ms t = 60 ms |
Gratings: ++, +-, -+, - - |
||||||
|
Meijer, van den Wildt & van den Brink (1978) |
5.5' 11' 11' - - 13.7' 66' - - 96' |
120 td |
no no no yes yes yes |
-- -- -- t = 70 ms t = 70 ms t = 70 ms |
++ and some - - |
||||||
|
Roufs & Blommaert (1981) |
0.8' 1° |
1200 td |
no yes |
-- t = 60 ms |
|||||||
|
Musselwhite & Jeffreys (1983) |
grid of dots 16' |
50 cd/m2 2.5 cd/m2 |
no no |
recorded VEP's ++, +-, -+, - - |
|||||||
|
Bergen & Wilson (1985) |
1° 0.125° |
17.2 cd/m2 |
yes yes |
t = 50 ms t = 50 ms |
Difference of gaussian: +++, -+- |
||||||
|
Uchikawa & Ikeda (1986) |
45' chrom. achrom. |
0 cd/m2 |
no no yes |
-- -- t = 40 ms |
++, +-, -+, - - for chromatic & achromatic stimuli |
||||||
|
Georgeson (1987) |
1.5 c/deg 3 c/deg 6 c/deg 12 c/deg |
16 cd/m2 |
yes no no no |
t = 80 ms |
++, +- for above-threshold and near-threshold stimuli. Found TI at all gratings for above-threshold task. |
||||||
|
Bowen (1989) |
0.5 c/deg |
10 td |
yes |
++ supra-threshold |
|||||||
|
Burr & Morrone (1993) |
1 c/deg chrom. achrom. |
16 cd/m2 |
no yes |
t=80 ms |
++, +-, -+, - - for isoluminant & achromatic stimuli |
||||||
|
Table 2. Characteristics of Models of Two-Pulse Performance. Adapted from the original source materials. |
||||||||||
|
Authors |
Stimuli |
Model Specifics |
Nature of the Impulse Response Function |
Predicts Inhibition |
Conditions |
|||||
|
Ikeda (1965) |
30' disks |
Derived a summation index, from which the temporal response functions for positive and negative stimuli may be inferred. |
unknown, but has inhibitory lobe |
yes |
++, +-, -+, - - |
|||||
|
Rashbass (1970) |
17° disks |
Assumed a low-pass linear filter, that is squared and integrated. Predicts elliptical relationship for contrast thresholds. |
unspecified |
yes |
++, +-, -+, - - |
|||||
|
Sperling & Sondhi (1968) |
Gratings |
The nature of the temporal inhibition changes with background luminance level. |
biphasic, varies with background level |
yes |
||||||
|
Watson & Nachmias (1977) |
Gratings |
Extends Rashbass model to include probability summation in time (PST). |
biphasic |
yes |
++, +-, -+, - - |
|||||
|
Roufs (1974) |
1 ° Disks |
Peak detection model conjoined with low-pass linear filter |
biphasic |
yes |
++ |
|||||
|
Uchikawa & Ikeda (1986) |
Iso-Luminant disks |
Adopts Watson's PST model to predict iso-luminant thresholds. No inhibition found in iso-luminant stimuli. |
monophasic |
no |
++ |
|||||
|
Roufs & Blommaert (1981) |
1° foveal disk |
Derives form of impulse response function by looking at interaction of two responses at peak height. Criticized by Watson for ignoring effects of probability summation in time. |
triphasic, with an initial inhibitory lobe |
yes |
++ |
|||||
|
den Brinker (1989, 1990) |
1° disk |
Adopts Roufs & Blommaert model, but accounts for addition of temporal inhibition for higher background levels. |
fourth-order multiphasic with an initial inhibitory lobe |
yes |
++ |
|||||
|
Bowen (1989) |
high-contrast 25' disk |
Looked at ratio of peaks in sensory response function to predict the number of perceived flashes. |
multiphasic, damped sinusoid |
yes |
++ |
|||||
|
Ohtani & Ejima (1988) |
Gratings |
Adopt Watson's PST model. Infer the characteristics of the low-pass filter under different stimulus characteristics. |
biphasic |
yes |
++, +-, -+, - - |
|||||
|
Busey & Loftus (1994) |
Digits |
Assumes a threshold such that information is extracted at some rate once the sensory response function exceeds this threshold. |
monophasic |
no |
++ |
|||||
|
Smith (1995) |
Squares presented in static noise |
Fits reaction time data with a model assuming contributions from transient and sustained mechanisms. |
biphasic |
yes |
+ |
|||||
|
TABLE 3. Summary of Best-Fitting Model Parameters for Experiments 1-3. RMSE's are in units of log(1/contrast) for Experiments 1 and 3, and units of proportion correct for Experiment 2 |
||||||||||
|
Experiment 1 (n=9) |
||||||||||
|
Observer |
t |
c s |
Q |
r |
s |
RMSE |
||||
|
TB |
5.69 |
0.410 |
0.0201 |
2.12 |
0.678 |
0.0364 |
||||
|
SD |
6.43 |
0.310 |
0.0059 |
1.35 |
1.080 |
0.0288 |
||||
|
MB |
7.04 |
0.389 |
0.0138 |
1.64 |
0.666 |
0.0240 |
||||
|
TB Identification |
4.42 |
0.576 |
0.0383 |
3.49 |
0.477 |
0.0387 |
||||
|
Experiment 2 (n=9) |
||||||||||
|
Observer |
t |
c s |
Q |
r |
s |
RMSE |
||||
|
TB |
6.67 |
0.846 |
0.0245 |
1.53 |
0.229 |
0.0269 |
||||
|
SD |
8.17 |
0.494 |
0.0180 |
0.40 |
0.366 |
0.0247 |
||||
|
MB |
10.01 |
1.250 |
0.0198 |
0.42 |
0.272 |
0.0181 |
||||
|
MB- No Inhibition |
5.47 |
1.379 |
0.0317 |
-- |
-- |
0.0217 |
||||
|
Experiment 3 (n=9) |
||||||||||
|
Observer |
t |
r |
s |
g |
RMSE |
No Inhibition RMSE |
||||
|
TB |
4.55 |
1.68 |
0.30 |
25.9 |
0.0419 |
0.0686 |
||||
|
NB |
6.29 |
1.02 |
0.68 |
57.6 |
0.0643 |
0.0713 |
||||
|
AA- Round 1 |
5.21 |
-- |
-- |
22.7 |
-- |
0.0358 |
||||
|
AA- Round 2 |
5.24 |
-- |
-- |
27.1 |
-- |
0.0267 |
||||
|
AA-Round 2 w/Inhibition |
5.32 |
1.75 |
0.11 |
27.8 |
0.0253 |
-- |
||||
Figures
|
|
|
Figure 1. Theoretical components of the linear filter model of character identification. Top Panel: A stimulus is characterized as its contrast over time, in this case a digit appearing for 30 ms at 15% contrast, disappearing for 5 ms, and then re-appearing for 30 ms in the same location and at the same contrast. Middle Panel: The temporal response properties of the visual system are characterized by an impulse response function that determines how long a brief stimulus persists in time. Temporal inhibition produces a biphasic impulse response function (dashed curve). Bottom Panel: The stimulus engenders a response in the visual system that is a function of the stimulus input function f(t) and the impulse response function g(t). The dashed function demonstrates the response produced in a system that includes temporal inhibition. One consequence of the information extraction components of the theory is that performance (or contrast sensitivity) is proportional to the above-threshold area under the sensory response function. |
|
|
|
|
Figure 2. The temporal frequencies underlying a task may be characterized by an impulse response function (left panel) that characterizes the time course of the perceptual response engendered by a stimulus, or by the temporal contrast sensitivity function (right panel), that characterizes the fidelity by which the pathways subserving a given task pass different temporal frequencies. A monophasic impulse response function has no inhibitory lobe, no dip in the TCSF at low temporal frequencies, and by definition no temporal inhibition. A biphasic impulse response function includes all three. Parameters used: Monophasic:{t=4.4, s=0} Biphasic:{t=3.58, r= 2.0,s= 0.39} |
|
|
|
Figure 3. Two-Pulse detection data adapted from Rashbass (1970). The LST model (Busey & Loftus, 1994) can account for temporal inhibition seen in these data when extended with a biphasic impulse response function via Eq 2. Contrast sensitivity is directly related to the above-threshold area. As ISI increases, sensitivity decreases in the same-contrast condition (Panels A and B) as the inhibitory lobe from the first pulse inhibits the excitatory portion of the response to the second pulse (Panel B). Sensitivity increases in the opposite-contrast condition (Panels C and D) since the inhibitory lobe from the first pulse facilitates the negative-going excitatory lobe from the opposite-contrast second pulse (Panel D). |
|
|
|
|
|
|
Figure 4. Experiment 1 results for three observers. Conditions are averaged across stimulus type, such that the ++ and -- conditions are shown as the Same-Contrast case, and the -+ and +- conditions are shown as the Opposite-Contrast case. Error bars represent the standard deviation of the threshold-finding function. The lower-right panel shows the data and model fit for Observer TB for an identification task, which is described in the text. Error bars represent 1 standard deviation of the Quest function (Watson & Pelli, 1983). |
|
|
|
|
|
|
Figure 5. Experiment 2 results for three observers. Background luminance was identical to Experiment 1. Task was recalling 4 digits. Error bars represent 1 standard error of the mean. The predictions for the modified character identification theory are shown as the curves. The lower-right graph shows the relatively poor fit of the non-inhibition model to Observer MB's data. Error bars represent one standard error of the mean. |
||
|
|
|
|
Figure 6. Example temporal wave forms used in Experiment 3. |


Figure 7. Temporal contrast sensitivity functions (TCSFs) for three observers for Experiment 3. The insets show the recovered impulse response functions. Error bars represent the standard deviation of the Quest threshold estimate (Watson & Pelli, 1983).


Figure 7 Continued.