Transforming Speech Recognition with Miguel Jetté of Rev.com
Manage episode 424194625 series 3570809
Miguel Jetté, VP of AI at Rev.com, shares insights into the evolution and impact of AI in transcription and speech recognition. Rev started as a platform offering transcription, captions, and subtitles, heavily relying on AI to improve its tools and products. Miguel's journey began eight years ago, focusing on building speech recognition capabilities.
This technology provides a first draft for "Revvers" (transcribers) to polish, enhancing efficiency and accuracy. He highlights the importance of a large and quality dataset, derived from Rev's transcription work, as a key advantage in training their speech recognition model. Challenges in testing and improving these models include dealing with subjective interpretations of transcripts. Miguel also discusses future directions, including tackling more complex use cases, multilingual support, and expanding technology to other languages and fields like translation and generative audio.
His career path, from a mathematics background to leading AI advancements at Rev, underscores the interdisciplinary nature and rapid evolution of speech technology and AI applications.
Notable Quotes from Miguel Jetté:
🎙️ Rev started as a platform for work at home jobs, diving deep into transcription, captions, and subtitles, powered by AI innovations.
🌍 Expanding our technology to embrace multilingual support and translation is not just exciting; it's a path towards global accessibility.
💡 The interaction between machines and humans at Rev showcases the real magic in improving productivity and accuracy through AI.
Resources Mentioned
Book: Computer speech technology by Rodman, Robert 1999
Miguel Jetté is Vice President of Artificial Intelligence at Rev. He leads Rev’s speech research and development team with over 20 years’ experience in speech recognition and machine learning. Before joining Rev, Migüel was a speech scientist at VoiceBox and Nuance Communications where he created state-of-the-art speech models across major industries in multiple languages. Migüel has a Master’s in mathematics and statistics from McGill University.
Connect with Miguel on LinkedIn: https://www.linkedin.com/in/migueljette/
Let's Connect
YouTube @YourAIRoadmap
LinkedIn Let Joan know you listen!
Pre-order Joan's Book! ✨📘🔜 Your AI Roadmap: Actions to Expand Your Career, Money, and Joy Jan 9, 2025, Wiley
Who is Joan? Ranked the #4 in Voice AI Influencer, Dr. Joan Palmiter Bajorek is the CEO of Clarity AI, Founder of Women in Voice, & Host of Your AI Roadmap. With a decade in software & AI, she has worked at Nuance, VERSA Agency, & OneReach.ai in data & analysis, product, & digital transformation. She's an investor & technical advisor to startup & enterprise. A CES & VentureBeat speaker & Harvard Business Review published author, she has a PhD & is based in Seattle.
Clarity AI builds AI that makes businesses run better. Our mission is to help SMB + enterprise leverage the power of AI. Whether your budget is 5-8 figures, we can build effective AI solutions. Book a 15min
♥️ Love it? Rate, Review, Subscribe. Send to a friend 😊
Kapitel
1. Introduction & Background (00:00:00)
2. Rev's AI Innovations: Leveraging AI to enhance transcription, captions, and subtitles. (00:02:29)
3. Speech Recognition at Rev: Miguel's journey in building speech recognition tools. (00:03:42)
4. Developing Speech Recognition: Challenges in creating effective models with unique datasets. (00:05:20)
5. Field Surprises: The nuanced difficulties in transcription and speech recognition. (00:07:24)
6. Complexity in the Field: Addressing the intricate challenges of accurate transcription. (00:08:25)
7. Future Directions: Exploring multilingual support and advancements in speech technologies (00:09:05)
8. Upcoming Projects: Enhancing tools for subtitlers and expanding language capabilities. (00:10:25)
9. Customization for Users: Tailoring solutions to meet diverse customer needs (00:11:12)
10. Advancements in Translation and Text-to-Speech (00:15:19)
11. Speech Technology Perceptions: Debunking the notion that speech recognition is fully "solved." (00:15:33)
12. Career Advice: Insights on entering the field and the importance of team fit in hiring. (00:17:24)
13. Hiring Focus: Looking for relevant experience and a good fit for the team's culture. (00:30:35)
14. Final Thoughts: Encouragement for those aspiring to join the AI and speech technology field. (00:35:16)
15. Closing: How to connect with Miguel and learn more about Rev's work (00:35:51)
26 episoder