Voice AI startup ElevenLabs has made headlines with the unveiling of its innovative feature, GenFM, designed to create dynamic, multilingual podcasts. The announcement, made on a Wednesday in late November 2024, has set the stage for ElevenLabs to compete directly with Google’s NotebookLM, which also focuses on AI-driven podcast creation. GenFM is integrated into the ElevenLabs Reader iOS app, expanding the company’s offerings in the voice AI landscape. Users can now upload various forms of content—such as YouTube videos, text, or documents—and the app employs advanced algorithms to automatically select two distinct voices to generate a cohesive podcast episode. Currently, GenFM boasts impressive support for 32 languages, including major ones like English, Hindi, Portuguese, Chinese, Spanish, French, German, and Japanese. One of the intriguing aspects of the GenFM feature is its ability to add human-like elements, like ‘ums’ and thoughtful pauses, which have typically been eliminated by other AI tools. Jack McDermott, ElevenLabs’ mobile growth lead, shared insights about the design philosophy behind incorporating such features. He stated, “We debated about how much to introduce ‘ums,’ ‘ahs,’ ‘mhmms’/ laughter/breathing similar human dialogue fillers or overlays — we’re aiming to strike the right balance of natural, human conversation and providing utility from the content.” This human-centric approach aims to enhance the listening experience by promoting natural, insightful conversation flows that many audiences find appealing. McDermott also noted that, unlike many long-form podcasts that suffer from frequent interruptions, the focus with GenFM is on creating seamless narratives that are easily accessible. This blend of technology and human touch may grant ElevenLabs a competitive edge in the crowded podcasting market where quality and engagement are key. Looking ahead, the company plans to roll out further customization options and the capability to integrate multiple content sources for even richer generative AI podcasts. This forward-thinking approach demonstrates ElevenLabs’ commitment to constant innovation within the voice AI domain. Earlier in the year, Google had introduced similar capabilities with NotebookLM, allowing users to generate AI conversations from uploaded sources. Following its initial launch, Google enhanced its feature set to cater to customization preferences for AI-generated podcast outputs, marking a significant shift in how content creators can leverage technology. In conjunction with the launch of GenFM, ElevenLabs has also been making strategic business moves, notably its recent $11 million investment into the Polish startup ecosystem. This investment coincides with the opening of a new office in Warsaw, which is set to become the company’s research and development hub, aimed at attracting local AI talent. Additionally, ElevenLabs is expanding into the Indian market and has already appointed a business head while initiating efforts to build a robust team in the region. Notably, the company has also begun offering conversation AI agents to its clientele, further diversifying its product lineup. With GenFM, ElevenLabs is not only enhancing its product offerings but also redefining the landscape of AI-generated audio narratives. As the podcasting medium continues to evolve, tools like GenFM promise to make the creation of engaging, multilingual content more accessible than ever before, setting a new standard for podcasts that resonate with diverse global audiences.