Designing for Auditory Accessibility: What Your Learners Aren't Telling You
- James Bright
- Apr 7
- 4 min read

When most people think about accessibility in eLearning, vision tends to dominate the conversation. Screen readers, alt text, color contrast. And while all of that matters (we covered it in the last post), auditory accessibility often gets treated as an afterthought - a quick closed caption drop at the end of the build, if it happens at all.
That's a problem. And the numbers make it hard to ignore.
According to the CDC, approximately 12% of all U.S. workers have some degree of hearing difficulty. That's not a fringe statistic. That's roughly one in eight people on your learner roster who may be experiencing your audio-heavy course very differently than you designed it.
So let's talk about what auditory accessibility actually requires, and what it looks like in practice.
Who Are We Designing For?
Auditory disabilities exist on a wide spectrum, just like visual ones. We're talking about people who are deaf or profoundly hard of hearing, people with mild to moderate hearing loss, people with tinnitus, and people who process audio differently due to cognitive or neurological differences. Not to mention those who the course's native language may be their second or third language - having captioning for these learners also helps them ensure better processing.
And then there's the situational piece. A learner taking your course on a noisy factory floor. Someone in an open office who forgot their headphones. A parent taking training at 10pm while a baby sleeps in the next room. Auditory accessibility doesn't just serve people with permanent hearing loss. It serves everyone who can't or won't use audio at any given moment.
That's a much bigger audience than most designers account for.
The Big Four for Auditory Accessibility in eLearning
1. Closed Captions - and Doing Them Right
Closed captions are the baseline. If your eLearning contains any audio - narration, video, dialogue - it needs captions. Full stop.
But captions are only useful if they're accurate, synchronized, and readable. Auto-generated captions are a starting point, not a finish line. They miss proper nouns, technical terminology, and anything your narrator said quickly or quietly. Every caption file needs a human review pass before it goes live - yes, even if caption review is your personal designer kryptonite. It's that important.
Designer Tip: In Articulate Storyline 360, you can import an SRT/VTT file directly or use the built-in caption editor. Camtasia has auto-transcription built in, and it has gotten significantly better in recent versions - but still review everything before you publish. For a faster and more accurate workflow, Rev.com offers AI-assisted and human-reviewed captioning at a reasonable cost. It's worth it for longer or more complex content.
2. Transcripts
Captions and transcripts are not the same thing. Captions are synchronized with the audio in real time. A transcript is a standalone text document of the full audio content, and for many learners, it's actually more useful.
Transcripts allow learners to read at their own pace, search for specific content, and reference material after the course is complete. They're also invaluable for learners who use screen readers or have processing differences that make real-time audio difficult to follow.
Designer Tip: If you already have a narration script, your transcript is mostly written. Clean it up, format it for readability, and make it available as a downloadable resource. This takes almost no additional time and significantly increases accessibility. Many authoring tools allow you to add the transcript to a side section of the player as well. Check your tool for the steps.
3. No Audio-Only Cues
This is one that quietly breaks courses all the time. If your course uses sound to communicate something - a chime when an answer is correct, a buzzer when it's wrong, an audio alert to indicate a new section - and that sound is the only signal, you've lost your hearing-impaired learners entirely.
Every audio cue needs a visual equivalent. Correct answer? Show a checkmark or "Correct" on screen. New section? A visual transition or on-screen indicator. Wrong answer? A visual alert alongside any sound.
Designer Tip: Go through your course with your speakers off. If you can still follow every instruction, understand every piece of feedback, and navigate every interaction without audio - you're in good shape. If anything becomes unclear or unusable, that's your fix list.
4. Video Accessibility Beyond Captions
If your course includes video content, captions alone may not be enough. Audio descriptions - narration that describes what's happening visually for learners who can't see the screen clearly - are required under WCAG 2.1 for meaningful visual content that isn't conveyed through the existing audio track.
This is less common in standard corporate eLearning but worth knowing, particularly if your content includes demonstrations, scenarios, or video where the visual action carries significant meaning.
Designer Tip: When scripting video content, build the description into the narration itself where possible. If a character is demonstrating a process on screen, have the narrator describe what's happening rather than assuming the learner can see it. This serves both auditory and visual accessibility simultaneously.
A Note on Background Music
Background music in eLearning is a design choice worth examining carefully. For some learners, it creates atmosphere and reduces cognitive fatigue. For others (particularly those with auditory processing differences, tinnitus, or ADHD) it's an active barrier to comprehension.
If you use background music, keep it low, keep it instrumental, and wherever possible give learners the ability to turn it off independently of the narration. When in doubt, leave it out.
What Good Looks Like
Auditory accessibility isn't about stripping your courses of sound. It's about making sure that no learner is dependent on audio to access the content, and that every audio element has a text or visual equivalent.
Done well, it doesn't just serve deaf and hard-of-hearing learners. It serves everyone who learns differently, works in challenging environments, or simply prefers to read rather than listen.
The same federal compliance work that taught me visual accessibility taught me this too. You don't realize how audio-dependent most eLearning is until you're required to account for every learner who can't hear it.
What auditory accessibility tips or tools have you found most useful in your builds? Drop them in the comments.



Comments