Efficiently generating sentence-level textual adversarial examples with Seq2seq Stacked Auto-Encoder

Ang Li, Fangyuan Zhang, Shuangjiao Li, Tianhua Chen, Pan Su, Hongtao Wang

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

In spite deep learning has advanced numerous successes, recent research has shown increasing concern on its vulnerability over adversarial attacks. In Natural Language Processing, crafting high-quality adversarial text examples is much more challenging due to the discrete nature of texts. Recent studies perform transformations on characters or words, which are generally formulated as combinatorial optimization problems. However, these approaches suffer from inefficiency due to the high dimensional search space. To address this issue, in this paper, we propose an end-to-end Seq2seq Stacked Auto-Encoder (SSAE) neural network, which generates adversarial text examples efficiently via direct network inference. SSAE has two salient features. The outer auto-encoder preserves syntactic and semantic information to the original examples. The inner auto-encoder projects sentence embedding into a high-level semantic representation, on which constrained perturbations are superimposed to increase adversarial ability. Experimental results suggest that SSAE has a higher attack success rate than existing word-level attack methods, and is 100x to 700x faster at attack speed on IMDB dataset. We further find out that the adversarial examples generated by SSAE have strong transferability to attack different victim models.
Original languageEnglish
Article number119170
Number of pages9
JournalExpert Systems with Applications
Volume213
Issue numberPart C
Early online date16 Nov 2022
DOIs
Publication statusPublished - 1 Mar 2023

Fingerprint

Dive into the research topics of 'Efficiently generating sentence-level textual adversarial examples with Seq2seq Stacked Auto-Encoder'. Together they form a unique fingerprint.

Cite this