Back to Main Conference 2024
LREC-COLING 2024main

Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/2ivv5y3ufu9y

Abstract

No abstract available.

Details

Paper ID
lrec2024-main-1462
Pages
pp. 16802-16830
BibKey
rao-etal-2024-tricking
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • AR

    Abhinav Rao

  • SV

    Sachin Vashistha

  • AN

    Atharva Naik

  • SA

    Somak Aditya

  • MC

    Monojit Choudhury

Links