Enhancing Accessibility: Google Docs Introduces AI-Powered Audio Features
Text-to-speech technology has been around for quite some time. Many computers have had this capability for years, albeit often with less-than-satisfactory outcomes.
In an effort to improve this functionality, Google has rolled out an AI-driven audio feature for Google Docs users. This innovative option allows users to generate “audio versions” of their documents, harnessing the capabilities of the Gemini AI system to provide a more lifelike text-to-speech experience. While the results show promise, there remains room for improvement.
The announcement was made through a blog post on Wednesday, just prior to the Made by Google 2025 event. To utilize this new feature, users can open a document and select the “Tools” tab from the menu. For those with access, a newly listed “Audio” option will be available. Activating this will display a playback bar at the bottom left of the interface, which can be repositioned as desired. Once the AI processes the document, it will automatically begin to read aloud.
Google’s AI voice technology showcases a mixed performance in this update. The voice often sounds realistic and captures certain natural qualities and rhythms; however, there are instances where it falls short, revealing typical AI limitations.
Notably, several customization options exist, allowing users to tailor the experience. Playback speeds can be adjusted anywhere between 0.5x and 2x, and there is a selection of seven distinct voice options available. The default voice setting, named Narrator, is characterized as “smooth, medium pitch,” alongside six alternative choices, which include:
- Educator: Friendly, higher pitch
- Teacher: Clear, low pitch
- Persuader: Engaging, low pitch
- Explainer: Lively, low pitch
- Coach: Lively, higher pitch
- Motivator: Energetic, medium pitch
Authors of documents can also embed an audio button within their files, enabling all contributors and readers to listen at their convenience.
This feature is accessible for a broad range of Google Workspace users, encompassing Business Standard and Plus, Enterprise Standard and Plus, as well as various iterations of Gemini Education and Business offerings. Additionally, subscribers to Google AI Pro or Google AI Ultra will find this functionality available to them as well.

