masked tokens. We can use such task, mask tokens — the attributes of original style — and prediction the substitution for them in our target style.
Also, there was trained T5 detoxification model on pseudo-parallel corpus, you can try it via HuggingFace interface 🤗.
The proposed models achieve today SOTA in style transfer for detoxification task!
You are welcome to test the models and write github issues 🙂