AI Red Teaming and AI Safety - Sounil Yu, Amanda Minnich - ESW #371

Security Weekly Podcast Network (Audio)

Innehåll tillhandahållet av Security Weekly Productions. Allt poddinnehåll inklusive avsnitt, grafik och podcastbeskrivningar laddas upp och tillhandahålls direkt av Security Weekly Productions eller deras podcastplattformspartner. Om du tror att någon använder ditt upphovsrättsskyddade verk utan din tillåtelse kan du följa processen som beskrivs här https://sv.player.fm/legal.

1M ago 2:18:23

MP3•Episod hem

In this interview we explore the new and sometimes strange world of redteaming AI. I have SO many questions, like what is AI safety?

We'll discuss her presence at Black Hat, where she delivered two days of training and participated on an AI safety panel.

We'll also discuss the process of pentesting an AI. Will pentesters just have giant cheatsheets or text files full of adversarial prompts? How can we automate this? Will an AI generate adversarial prompts you can use against another AI? And finally, what do we do with the results?

Resources:

We chat with Sounil Yu, co-founder of LLM access control startup, Knostic. We discuss both the experience of participating in Black Hat's startup competition, and what his company, Knostic, is all about. Knostic was one of four finalists for Black Hat's Startup Spotlight competition and was announced as the winner on August 6th.

References

, in the enterprise security news,

AI is still getting a ton of funding!
Netwrix acquires PingCastle
Tenable looks for a buyer
SentinelOne hires Alex Stamos as their new CISO
Crowdstrike doesn’t appreciate satire when it’s at their expense
Intel begins one of the biggest layoffs we’ve ever seen in tech
Windows Downdate
RAG poisoning
GPT yourself
The Xerox Hypothesis

All that and more, on this episode of Enterprise Security Weekly.

Visit https://www.securityweekly.com/esw for all the latest episodes!

Show Notes: https://securityweekly.com/esw-371

2902 episoder

#Security #Software Development #Tech News #Hacking #News #Tech #Security Weekly Productions