Beyond Guardrails: Defending LLMs Against Sophisticated Attacks Podcast Por  capa

Beyond Guardrails: Defending LLMs Against Sophisticated Attacks

Beyond Guardrails: Defending LLMs Against Sophisticated Attacks

Ouça grátis

Ver detalhes do programa

Sobre este áudio

Jason Martin is an AI Security Researcher at HiddenLayer. This episode explores “policy puppetry,” a universal attack technique bypassing safety features in all major language models using structured formats like XML or JSON.

Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/

Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.

Detailed show notes - with links to many references - can be found on The Data Exchange web site.

O que os ouvintes dizem sobre Beyond Guardrails: Defending LLMs Against Sophisticated Attacks

Nota média dos ouvintes. Apenas ouvintes que tiverem escutado o título podem escrever avaliações.

Avaliações - Selecione as abas abaixo para mudar a fonte das avaliações.