Light

2.3 Psychoacoustics: Perception of Sound Properties

5 min read•august 9, 2024

Sound waves are complex, but our brains make sense of them effortlessly. We'll dive into how we perceive , pitch, and . These qualities help us understand and enjoy music, speech, and environmental sounds.

Our ears and brain work together to process sound in amazing ways. We'll explore how we locate sounds, separate them in noisy environments, and deal with effects. Understanding these processes sheds light on our auditory experiences.

Perception of Sound Attributes

Loudness and Pitch Perception

Top images from around the web for Loudness and Pitch Perception

Hörschwelle – Wikipedia View original
Is this image relevant?
Sound Intensity and Level | Boundless Physics View original
Is this image relevant?
Acoustic quantities, part 2: Frequency weighting - Erlend M. Viggen View original
Is this image relevant?
Hörschwelle – Wikipedia View original
Is this image relevant?
Sound Intensity and Level | Boundless Physics View original
Is this image relevant?

1 of 3

Top images from around the web for Loudness and Pitch Perception

Hörschwelle – Wikipedia View original
Is this image relevant?
Sound Intensity and Level | Boundless Physics View original
Is this image relevant?
Acoustic quantities, part 2: Frequency weighting - Erlend M. Viggen View original
Is this image relevant?
Hörschwelle – Wikipedia View original
Is this image relevant?
Sound Intensity and Level | Boundless Physics View original
Is this image relevant?

1 of 3

Loudness relates to the perceived intensity of sound
- Measured in
- Influenced by sound pressure level and frequency
- Follows , describing the relationship between physical stimulus intensity and perceived loudness
illustrate how perceived loudness varies with frequency
- Show equal-loudness contours for different sound pressure levels
- Demonstrate that human hearing is most sensitive to frequencies between 2-5 kHz
involves the brain's interpretation of sound frequency
- Measured in
- Relates to the fundamental frequency of a sound wave
- Influenced by factors such as harmonics and overtones
explains pitch perception for high-frequency sounds
- Different frequencies stimulate specific areas along the basilar membrane
- Higher frequencies activate areas closer to the base of the cochlea
accounts for pitch perception of low-frequency sounds
- Neural firing patterns synchronize with the sound wave's periodicity
- Effective for frequencies up to about 5000 Hz

Timbre and Just Noticeable Difference

Timbre distinguishes sounds with the same pitch and loudness
- Determined by the harmonic content and envelope of a sound
- Allows differentiation between musical instruments or voices
- Influenced by factors such as attack, decay, sustain, and release (ADSR envelope)
Spectral envelope shapes timbre perception
- Represents the overall distribution of energy across frequencies
- Unique for different instruments and sound sources
Formants contribute to timbre, especially in speech and singing
- Resonant frequencies of the vocal tract
- Help distinguish between different vowel sounds
measures the smallest detectable change in a stimulus
- Varies depending on the sound attribute (pitch, loudness, or timbre)
- For frequency, JND is about 0.3% in the mid-frequency range
- For intensity, JND is approximately 1 dB for moderate sound levels
relates JND to the intensity of the original stimulus
- States that the JND is a constant fraction of the original stimulus intensity
- Applies to various sensory modalities, including auditory perception

Auditory Processing

Auditory Scene Analysis and Streaming

involves parsing complex acoustic environments
- Enables listeners to separate and identify individual sound sources
- Crucial for understanding speech in noisy environments ()
guide auditory grouping
- Similarity: grouping sounds with similar characteristics
- Proximity: grouping sounds close in time or frequency
- Continuity: perceiving continuous sounds despite brief interruptions
organizes sounds into coherent perceptual units
- Allows tracking of individual sound sources over time
- Influenced by factors such as frequency separation and presentation rate
occurs automatically based on acoustic cues
- Relies on innate perceptual mechanisms
- Includes processes like frequency separation and onset synchrony
involves learned patterns and expectations
- Utilizes prior knowledge and experience
- Enables recognition of familiar sounds or speech patterns

Masking and Critical Bands

Masking occurs when one sound interferes with the perception of another
- Temporal masking: interference between sounds separated in time
  - Forward masking: a sound masks subsequent sounds
  - Backward masking: a sound masks preceding sounds
- Simultaneous masking: interference between concurrent sounds
  - More effective when masker and target are close in frequency
represent frequency ranges processed by the auditory system
- Approximately 1/3 octave wide in the mid-frequency range
- Correspond to regions along the basilar membrane
- Play a crucial role in frequency selectivity and masking effects
divides the auditory spectrum into critical bands
- 24 critical bands covering the range of human hearing
- Used in various psychoacoustic models and audio compression techniques
model the frequency selectivity of the auditory system
- Represent the tuning characteristics of the cochlea
- Wider bandwidth at higher frequencies
- Contribute to our ability to separate and analyze complex sounds

Spatial Hearing

Binaural Hearing and Sound Localization

utilizes input from both ears to process spatial information
- Enables accurate and improved speech intelligibility in noise
- Contributes to the perception of auditory space and distance
cues sound location in the horizontal plane
- Based on the difference in arrival time of sound at each ear
- Most effective for low-frequency sounds (below about 1500 Hz)
- Typical ITD range: 0 to 700 microseconds
provides additional localization cues
- Results from the head shadow effect, where the head attenuates sound
- Most effective for high-frequency sounds (above about 1500 Hz)
- Can reach up to 20 dB difference between ears
describe how the ear receives sound
- Account for the effects of the head, pinna, and torso on incoming sound
- Unique to each individual and crucial for accurate sound localization
- Used in virtual audio systems to create realistic 3D sound
presents challenges in sound localization
- Occurs when ITD and ILD cues are ambiguous
- Resolved through head movements and spectral cues from the pinna
relies on multiple cues
- Intensity: louder sounds perceived as closer
- Reverberation: more reverberant sounds perceived as further away
- Spectral changes: high frequencies attenuate more over distance
aids in localizing sounds in reverberant environments
- Suppresses echoes and reflections arriving shortly after the direct sound
- Typically operates within a 30-40 millisecond window
- Enables accurate localization despite multiple sound arrivals

Key Terms to Review (33)

Audio engineering: Audio engineering is the field focused on the technical aspects of recording, mixing, and reproducing sound. It combines knowledge of acoustics, electronics, and music theory to manipulate audio signals and create high-quality sound recordings. Understanding the perception of sound properties is crucial in audio engineering, as it helps engineers make informed decisions about how to capture and reproduce sound that is pleasing and impactful to listeners.

Auditory filters: Auditory filters are mechanisms within the auditory system that allow individuals to process and perceive different sound frequencies by isolating specific frequency bands. This filtering process helps the brain make sense of complex sounds in environments with multiple overlapping auditory signals, enhancing the ability to focus on a particular sound source, such as a voice in a crowded room. Understanding auditory filters is essential for grasping how humans perceive sound properties like pitch, timbre, and loudness.

Auditory scene analysis: Auditory scene analysis is the process by which the auditory system organizes sound signals into distinct sources and events, allowing individuals to perceive and understand complex auditory environments. This process is crucial in identifying sounds in music, speech, and other acoustic scenarios, enabling listeners to differentiate between overlapping sounds and locate their sources. It plays a key role in how we interpret auditory information and is essential for effective communication and musical experiences.

Bark Scale: The Bark Scale is a psychoacoustic scale used to describe how humans perceive loudness in relation to sound intensity. It represents a logarithmic relationship, meaning that a relatively small increase in sound pressure level corresponds to a significant increase in perceived loudness. This scale helps bridge the gap between the physical measurement of sound and human auditory perception, demonstrating that our ears perceive loudness in a non-linear way.

Binaural Hearing: Binaural hearing refers to the ability of the auditory system to perceive sound using both ears, allowing for localization, spatial awareness, and improved sound discrimination. This mechanism enhances our ability to interpret the direction and distance of sounds, making it crucial for effective communication and interaction with our environment. The interplay of various auditory cues processed by both ears enables a richer auditory experience, connecting closely with how we understand music and sound properties.

Catherine Carr: Catherine Carr is a prominent figure in the field of psychoacoustics, known for her research on how the human brain perceives and processes sound. Her work emphasizes the connections between auditory perception and psychological responses, exploring how various sound properties influence emotional and cognitive reactions in individuals. This intersection of sound perception and psychological processes is crucial for understanding how music and environmental sounds affect our experiences.

Cocktail party effect: The cocktail party effect is the ability of a listener to focus on a single conversation in a noisy environment while filtering out other sounds. This phenomenon highlights how the auditory system processes sound, allowing individuals to prioritize specific stimuli, like a friend’s voice, despite the presence of background noise, which is essential for social interactions.

Cone of Confusion: The cone of confusion refers to a phenomenon in sound localization where a listener has difficulty determining the exact location of a sound source due to the similar time and intensity cues received from different angles. This occurs primarily with sounds coming from directly above or below, as the interaural time difference and interaural level difference become less effective in these positions. Understanding this concept is essential for exploring how humans perceive sound directionality and spatial relationships.

Critical Bands: Critical bands refer to the frequency ranges within which multiple sounds can interact and influence each other's perception. This concept is crucial for understanding how humans perceive sound properties like pitch, loudness, and timbre, as it highlights the limitations of auditory processing in the human ear. The idea of critical bands explains why certain tones can mask others when they fall within the same band, affecting how we interpret sounds in complex auditory environments.

David Huron: David Huron is a prominent music psychologist known for his extensive research on the psychology of music, particularly how we perceive and process musical sounds and emotions. His work has helped bridge the gap between the fields of music theory and cognitive psychology, exploring how our understanding of music influences emotional responses and behavior in various contexts, such as marketing and consumer behavior.

Decibels (dB): Decibels (dB) are a logarithmic unit used to measure the intensity or level of sound. This scale allows us to express large variations in sound pressure levels in a more manageable way, as human perception of sound is not linear but logarithmic. Understanding decibels helps in comprehending how we perceive different volumes of sound and its psychological effects.

Distance perception: Distance perception is the ability to perceive the distance of sounds, which allows individuals to determine how far away a sound source is. This skill relies on various auditory cues, including differences in sound intensity, timing, and frequency, helping listeners locate and understand their environment. It plays a crucial role in how we interact with our surroundings and affects our emotional responses to music and other auditory stimuli.

Fletcher-Munson Curves: Fletcher-Munson curves, also known as equal-loudness contours, illustrate the relationship between sound frequency and perceived loudness. These curves demonstrate that human hearing is more sensitive to certain frequencies, particularly in the mid-range, and less sensitive to very low or very high frequencies. This phenomenon helps in understanding how we perceive sound and its properties, impacting audio engineering, music production, and psychoacoustics.

Gestalt Principles: Gestalt principles refer to a set of theories in psychology that explain how humans perceive and interpret visual stimuli as organized wholes rather than as separate parts. These principles highlight how our brain tends to group sensory information, helping us make sense of complex auditory experiences, particularly in the context of sound properties and music perception.

Head-related transfer functions (HRTFs): Head-related transfer functions (HRTFs) are mathematical representations that describe how an ear receives sound from a point in space, taking into account the shape of the head, ears, and torso. They play a crucial role in psychoacoustics by influencing how we perceive the direction and distance of sounds, allowing us to locate them in three-dimensional space. By capturing the unique filtering effects created by our anatomy, HRTFs are fundamental for understanding spatial hearing and are used in various audio technologies, including virtual reality and 3D audio applications.

Hertz (Hz): Hertz (Hz) is the unit of frequency that measures the number of cycles per second of a periodic wave, particularly in the context of sound waves. It plays a critical role in understanding how we perceive sound properties, such as pitch and tone. Higher frequencies correspond to higher pitches, while lower frequencies relate to lower pitches, making hertz an essential concept in psychoacoustics.

Interaural level difference (ILD): Interaural level difference (ILD) is the difference in the sound pressure level of a sound as it reaches each ear, helping to determine the direction of sound sources. This difference arises because sounds coming from one side of the head are louder at the nearer ear than at the farther ear, due to the head casting a sound shadow. This phenomenon plays a crucial role in how humans perceive spatial audio, contributing to our ability to localize sounds in our environment.

Interaural time difference (ITD): Interaural time difference (ITD) refers to the difference in arrival time of a sound at each ear, which plays a crucial role in how we localize sounds in our environment. This phenomenon occurs because sounds coming from one side reach the nearer ear slightly earlier than the farther ear, allowing the brain to process this timing information for spatial awareness. ITD is especially important for low-frequency sounds, which can have longer wavelengths that make them easier to localize based on timing differences rather than intensity differences.

Just noticeable difference (jnd): The just noticeable difference (jnd) refers to the smallest change in a stimulus that can be detected by an observer. This concept plays a crucial role in understanding how we perceive sound properties, as it helps to establish the threshold at which changes in sound can be recognized. The jnd is significant in various aspects of psychoacoustics, such as loudness, pitch, and timbre, influencing how we experience and interpret different auditory stimuli.

Loudness: Loudness is the perceptual response to the intensity of sound, often associated with how we perceive volume in our auditory experience. It is influenced by the sound's amplitude, frequency, and duration, as well as psychological factors like context and listener expectations. Understanding loudness is crucial for grasping sound fundamentals, its psychological perception, and how music interacts with other disciplines like acoustics and psychology.

Masking: Masking is a psychoacoustic phenomenon where the perception of one sound is hindered by the presence of another sound. This occurs when a louder sound makes it difficult to hear a softer sound, leading to a change in how we perceive sound properties like pitch and loudness. Masking can help us understand how sounds interact in complex auditory environments and illustrates the limitations of human hearing.

Music therapy: Music therapy is a clinical and evidence-based practice that uses music interventions to accomplish individualized goals within a therapeutic relationship. It connects the power of music to mental and emotional well-being, fostering healing, communication, and cognitive development.

Pitch Perception: Pitch perception refers to the ability to perceive the frequency of a sound, which determines how high or low it sounds to the listener. This ability is crucial in music and communication, as it influences how we recognize melodies, harmonies, and even speech intonation. Understanding pitch perception involves exploring how our auditory system processes sound waves and how various factors, including neurological conditions and cultural influences, shape our experience of music.

Place Theory: Place theory is a concept in auditory perception that explains how different frequencies of sound are processed by specific locations along the basilar membrane in the cochlea. This theory suggests that the perception of pitch is determined by the location of hair cells stimulated by sound waves, with higher frequencies activating hair cells closer to the base of the cochlea and lower frequencies stimulating those nearer to the apex. This connection to frequency localization is fundamental in understanding how we perceive and differentiate various sounds.

Precedence effect: The precedence effect refers to the phenomenon where a listener perceives the direction of a sound source more accurately when the sound arrives at the ears from that source before any reflections or echoes. This effect is crucial in understanding how we localize sounds in our environment, emphasizing the importance of timing and spatial relationships in auditory perception.

Primitive Segregation: Primitive segregation refers to the basic auditory process by which the brain organizes and separates sounds from different sources based on fundamental auditory features such as frequency, amplitude, and timing. This phenomenon allows listeners to identify distinct sound streams in complex auditory environments, highlighting the importance of these perceptual skills in understanding how we process and interact with sound in our daily lives.

Schema-based segregation: Schema-based segregation refers to the cognitive process through which listeners organize and differentiate sounds in complex auditory environments using pre-existing knowledge and expectations. This process allows individuals to effectively parse auditory scenes, enabling them to separate overlapping sounds based on contextual cues, learned experiences, and familiarity with certain sound patterns.

Sound Localization: Sound localization is the ability of an organism to determine the origin of a sound in space. This skill relies on auditory cues such as the time difference and intensity difference of sounds reaching each ear, allowing individuals to perceive their environment accurately. This perceptual ability is crucial for various activities, including communication, navigation, and recognizing potential threats.

Stevens' Power Law: Stevens' Power Law is a principle that describes the relationship between the physical intensity of a stimulus and the perceived intensity of that stimulus. It suggests that the perceived sensation grows at a specific power of the actual stimulus intensity, allowing for a more nuanced understanding of how we experience various stimuli, particularly in sound. This law plays a crucial role in psychoacoustics by helping to explain how changes in sound properties, such as loudness, are perceived by individuals.

Stream segregation: Stream segregation is the process by which the auditory system separates different sound sources in order to perceive them as distinct streams. This phenomenon allows listeners to focus on one sound while ignoring others, a critical aspect of how we navigate complex auditory environments, such as in music or conversation.

Temporal theory: Temporal theory is a model in psychoacoustics that explains how the brain perceives sound over time, particularly focusing on the timing and order of auditory events. This theory suggests that the perception of pitch and sound quality is influenced by the timing of sound waves hitting the ear, as well as their sequential arrangement. It plays a significant role in understanding how we identify melodies, recognize rhythms, and distinguish between different sound sources.

Timbre: Timbre, often referred to as the 'color' or 'texture' of sound, is the characteristic quality that distinguishes one sound source from another, even when they produce the same pitch and loudness. It involves the complex interplay of harmonics and overtones that create unique sound profiles for different instruments and voices, making timbre essential in identifying and differentiating sounds.

Weber's Law: Weber's Law states that the smallest change in a stimulus that can be detected is proportional to the original intensity of the stimulus. This principle highlights how perception of sensory input, including sound, operates on a relative basis rather than an absolute one, emphasizing that our ability to perceive differences in sound properties relies on the ratio of change to the original stimulus level.

AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.

Back

Practice QuizGlossary

Practice Quiz Glossary

2.3 Psychoacoustics: Perception of Sound Properties

Perception of Sound Attributes

Loudness and Pitch Perception

Top images from around the web for Loudness and Pitch Perception

Top images from around the web for Loudness and Pitch Perception

Timbre and Just Noticeable Difference

Auditory Processing

Auditory Scene Analysis and Streaming

Masking and Critical Bands

Spatial Hearing

Binaural Hearing and Sound Localization

Key Terms to Review (33)

© 2024 Fiveable Inc. All rights reserved.

AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.

Back

Next guide