Transforming Documents into Podcasts with Google’s Latest AI Feature
In the previous year, Google unveiled an innovative feature within its NotebookLM platform that quickly gained traction. This unique capability allows users to upload various documents, which the AI then transforms into a conversational podcast featuring two voices. The purpose of these podcasts is to serve as auditory learning tools rather than content meant for public streaming.
The concept behind this is simple: grasping complex ideas can often be more intuitive when presented as a dialogue. Excitingly, you no longer need to navigate the less familiar NotebookLM interface to access this functionality. Google’s new Audio Overviews feature is now offered for free via the Gemini application and its website. Furthermore, with the integration of Audio Overviews into Gemini, users can utilize Gemini’s Deep Research reports as material for their podcasts.
It has been discovered that a great way to maximize this feature is by first generating a Deep Research report on a topic in Gemini, subsequently using that report to produce an Audio Overview—making it unnecessary to engage with the text directly.
Create and Download Podcasts on Any Subject
To get started, simply access the Gemini website or mobile application. To upload a document or presentation, click the Plus icon and select your file. Following the processing of the document, a Generate Audio Overview button will appear for you to click.

Clicking the button initiates the process, which typically takes between 3 to 5 minutes to create the podcast audio, depending on the amount of content.
An alternative approach is to request a report using the Deep Research feature found beneath the text entry box. Similar to before, expect to wait a few moments for the research to be compiled. Once completed, access the Deep Research document, click the downward arrow, and then select the Generate Audio Overview button or simply enter “Generate Audio Overview” in the input field.

Upon completion, a notification will appear through the Gemini platform, and the audio player will be visible in the chat interface. Simply click the Play button to start listening, and use the seek bar to navigate through the recording. While there is an option to adjust playback speed, users can only increase it to a maximum of 1.5x.

An impressive 10-minute podcast overview can often be derived from as few as 12 pages of text, illustrating the level of information provided. If you prefer not to listen immediately or want to share the podcast, downloading the audio file for offline listening is an option. To do this, simply click the three-dot menu in the audio player and select the Download button.

Additionally, you can opt to share the conversation by generating a link to your Gemini chat and audio recording.
While exploring Gemini, consider creating custom AI bots, referred to as Gems, which are now available for all users at no cost.