Обложка канала

Towards NLP

NLP: все n-граммы про анализ текстов. По всем дополнительным вопросам:

Towards NLP

3 года назад
Открыть в
IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models We talked before about different techniques how to explain ML and NLP models. Ok, we have explained some model for a specific output, highlighted some tokens there. What should happen next? 📌You can use humans to debug and improve your model! What your steps can be: 1. 🔍You identify misclassified samples (for instance, during hate speech detection, you have noticed that the model is biased against some target words). 2. 📊You explain model's decisions and see that the models puts too much or too less weight/attention to some words. 3. 📝You edit the explanation, i.e. corresponding weights of the words spans that should contribute to the correct label. 4. 🔄You do this for several samples and retrain Adapter layer of your model based on new samples. 5. ✅Now your model's behavior is fixed, i.e. it is debiased! All this can be done with our platform: https://ifan.ml/ This is the first solid version, we are still developing many-many new features for it (as, for instance, the report page where you can control model's performance change). But already now, we believe that the platform can be a solid step to human-in-the-loop debugging of NLP models🤖. 📜The corresponding paper about this first version [link]
Towards NLP🇺🇦

Explainability for NLP With the raise of LLMs from ClosedAI, the research in explainability for NLP is important as never before. Still, a lot of work should be done in the field. However, you already can experiment and try explain your fine-tuned LLMs on a specific task. For now, the majority of methods are explored for texts classification tasks and are adjusted from tabular data. How it can be done? 1. Baseline approach: Leave-one-out explanations. For instance, you have a regression layer as one of the last layers in your model. You can check the tokens with major weights. Then, exclude them from the text and check if the model's answer has changed. If the tokens were indeed important, the answer should change dramatically as the model cannot orient on this words to make a correct decision. 2. Local Surrogate (LIME). Modification of the previous idea. Now, you delete each word from the sentence and check the results. The "importance" of the word will be estimated based on how the model's answer differ…

Telegram