Attack

Weight Poisoning Attacks on Pre-trained Models [ACL 2020] Keita Kurita, Paul Michel, Graham Neubig.
- change PLMs weight to attack classification result even if after finetune, and not need know the downstream knowledge.
- one is base on two part optimization.
- one is base on mean pooling the co-occurrence token embeddings.

Provide feedback