Nice library to simplify data preparation by outlier detection https://pyod.readthedocs.io/en/latest/