RepIt: Steering Language Models with Concept-Specific Refusal VectorsThe Annual Conference on Neural Information Processing Systems (NeurIPS 2025 Workshop on Socially Responsible and Trustworthy Foundation Models), 2025-04-23 00:00:00 -0700Share on Twitter Facebook LinkedIn Previous Next