Pitch Perception and Cochlear Implants
Over 59,000 people all over the world have cochlear implants (NIDCD 2006). The cochlea is a rigid, coiled tube that is divided mechanically into two along its length by the basilar membrane. It is an organ in the ear that aids in hearing by transforming a mechanical vibration (input) into action potentials in the fibers of the auditory nerve (Fearn et al. 1999). When people have severe auditory damage that cannot be relieved by hearing aids, they turn to cochlear implants.
Pitch and its Purpose
Pitch is the ear’s response to frequency and is involved in recognition of melody and tone. Without pitch, one would be able to understand the words being spoken to them, but wouldn’t know if they were being asked a question or told a statement (Anon 2005). Pitch is the fundamental frequency, which is the lowest resonant frequency of a vibrating object. All frequencies higher than this are harmonics (multiples) of the fundamental frequency. For example, the third harmonic is the three times larger than the fundamental frequency, while the fourth harmonic is four times larger than the fundamental frequency. The normal frequency range of the human ear is 20-20,000 Hz, this range is often referred to as “hi-fi†(Anon 2005a). Fine structure is a gradually varying envelope, decomposed from the original signal, which modulates an extraordinarily high frequency carrier. It contains more acoustic cue for pitch recognition than a temporal envelope, therefore it is believed that modifying a cochlear implant processor with fine structure may improve pitch perception of the patient (Chen et al. 2006).
In the normal auditory system, after a sound is captured by the outer ear, the three small bones of the middle ear (incus, malleus, and stapes) and the ear drum input a displacement signal into the window of the cochlea. This signal causes a transverse wave to travel down the basilar membrane, which then encodes pitch tonotopically. This means that higher frequencies cause maximum vibrations at the window end and lower frequencies cause maximum vibrations at the apex of the cochlea (place cue). Additionally, the phase locking of the nervous discharge to the pure-tone affects the pitch (timing cue). Then, hair cells on the basilar membrane bend back and forth according to the amount of vibration produced and release an electrochemical substance. This substance causes neurons to fire and action potentials travel down fibers of the auditory nerve. Once the auditory nerve receives these electrical signals, it will send this information about the acoustic signal to the brain to be interpreted (Fearn 1999, O’Reilly 2006, and Loizou 1998).
Cochlear Implants
A cochlear implant, also known as a bionic ear, is a surgically-implanted electronic device that partially restores hearing of deaf people by electric stimulation of the auditory nerve (Fearn 1999). It is different from a hearing aid that simply amplifies sounds that may be detected by damaged ears because a cochlear implant avoids damaged portions of the ear and directly excites the auditory nerve. Cochlear implants carry out the function of the cochlea by using a very specific system. They use an external microphone that attains sound from the environment and delivers it to a speech processor. The speech processor then separates different frequencies by using band-pass filters and sound intensity; it takes this electrical signal down a wire to the transmitter. The transmitter uses electromagnetic induction to transport the processed electric sound to the internal parts of the implant. The receiver and stimulator convert signals into appropriate electrical impulses and send them down internal cables to electrodes. A group of electrodes, called an electrode array, collects the processed and converted signals and sends them to the appropriate parts of the auditory nerve. This electrode array receives an assigned frequency range from about 100 Hz to 8000 Hz (NIDCD 2006a). Due to the electrode array not being able to be inserted entirely into the apex of the cochlea, frequency information cannot be stored in a cochlear implant. This is because there is a mismatch between the assigned speech frequency and the position of the electrodes.
Restoring Pitch
Current cochlear implant devices are poor in pitch perception because they don’t extract and encode spectral or temporal cues appropriately. They have been manufactured to deliver up to 22 frequency channels, however patients haven’t been able to utilize more than eight of those channels (Chen et al. 2006). Many ideas have been introduced to restore pitch in a cochlear implant. Among these ideas are transplantation of the implant, increasing the radius of the electric field of electrodes, bilateral implants, stem cell regeneration of the cochlea, and many more (Smale 2006). This paper will focus mainly on spectral channel enhancements, temporal fine structure, and deep insertion electrode array.
A big way that frequency information can be reintroduced to a cochlear implant is by spectral channel enhancements. Imagine a spectrum of sound that contains many peaks and valleys, depending on the nature of the sound. Patients with damaged hearing have poor auditory filters that allow little decibel difference between these peaks and valleys, causing their perception of speech to be deprived. Spectral channel enhancement increases the difference in decibel level between peaks and valleys that are next to each other. This is done by inhibiting the valleys and enhancing the peaks in the sound spectrum. After spectral channel enhancement, the decibel difference between the peaks and valleys will be more discrete and the patient will be able to distinguish speech in an environment with background noise (Yang et al. 2006). Spectral valleys are reduced by lateral inhibition, which is the suppression of some neurons in order to increase stimulation of other neurons. Lateral inhibition can sharpen average rate profiles by enhancing its output from spatially steep input regions or suppressing its output from spatially smooth input regions (Shamma 1985). Spectral peaks are enhanced by tow-tone suppression, which is reduction in response to one tone due to the presence of another tone. The brain uses tow-tone suppression in audition, as well as vision, to extract vital cues in a noisy environment (Zeng et al. 2005).
Temporal fine structure cueing is also a way to reintroduce frequency information to a cochlear implant. Temporal fine structure constructs a carrier signal for each frequency band after band-pass filtration of the speech signal into multiple frequency bands. This process is carried out by using high-rate sinusoidal pulses from the peak positions of the fine structure. Then, the signal is amplitude modulated by the temporal envelope in the band to create a decomposed band-specific output signal. This improvement for cochlear implants is beneficial when the patient’s temporal envelope contains between four and 16 frequency bands, like when communicating in tonal languages or listening to music (Chen et al. 2006). This method also allows for the same modulation to be applied to all channels, however it is better if channels are stimulated at the same time rather than alone then alone (Green et al. 2005).
A widely proposed method to reintroduce pitch into cochlear implants is to insert the electrode array deeper into the cochlea. This method is currently being tested at the
Alongside the previous ideas curtail many other ideas everyday to improve cochlear implants. Many models are in line to be available on the market, while some ideas are still years away. Nevertheless, cochlear implants are under improvement for the thousands of patients all around the world that are counting on them.
-Amy Shah


