#06 Exploring Large multimodal models in healthcare - GPT-4V, Google PaLI-3 explained
MP3•Episod hem
Manage episode 428686723 series 3585389
Innehåll tillhandahållet av Dev and Doc. Allt poddinnehåll inklusive avsnitt, grafik och podcastbeskrivningar laddas upp och tillhandahålls direkt av Dev and Doc eller deras podcastplattformspartner. Om du tror att någon använder ditt upphovsrättsskyddade verk utan din tillåtelse kan du följa processen som beskrivs här https://sv.player.fm/legal.
🤖Dev and doc👨🏻⚕️ introduces large multimodal models. ✨ The potential of LMMs combining text and images seem limitless, but what's the catch? Dev and Doc is a Podcast where developers and doctors join forces to deep dive into AI in healthcare. Together, we can build models that matter. 👨🏻⚕️Doc - Dr. Joshua Au Yeung - https://www.linkedin.com/in/dr-joshua-auyeung/ 🤖Dev - Zeljko Kraljevic https://twitter.com/zeljkokr 00:00 start 00:32 intro 02:20 what is multimodality? And what are the potentials? 09:43 Large multimodal models paper deep dive (radiology) 18:43 paper deep dive 2 (pathology) 20:40 large multimodal models technical overview, exploration of other LMMs 31:40 Foundational models explanation 35:18 the model transparency index 36:20 Google PaLI-3, light weight models vs large Foundational models 43:04 Summary 44:15 the problems and work to be done for LMMs - hallucinations, inconsistencies, biases, security 49:20 A call for better evidence generation and trials with LMMs 53:00 final points - improving visual spatial recognition, thoughts for future The podcast 🎙️ 🔊Spotify: https://open.spotify.com/show/3QO5Lr3w4Rd6lqwlfKDaB7?si=e7915d844994403e 📙Substack: https://aiforhealthcare.substack.com/ 🎞️ Editor- Dragan Kraljević https://www.instagram.com/dragan_kraljevic/ 🎨Brand design and art direction - Ana Grigorovici https://www.behance.net/anagrigorovici027d
…
continue reading
28 episoder