https://openreview.net/forum?id=ORKhJBFepG
Winter Soldier: Backdooring Language Models at Pre-Training with Indirect Data Poisoning |...
The pre-training of large language models (LLMs) relies on massive text datasets sourced from diverse and difficult-to-curate origins. Although membership...
indirect data poisoningwinter soldierlanguage modelspre training