INDEX
Explanations
this is a sensitive question
New Auto-Interp
Negative Logits
this
0.67
هذا
0.67
this
0.66
এই
0.64
этого
0.64
этом
0.63
この
0.63
questo
0.61
този
0.61
цьому
0.60
POSITIVE LOGITS
topic
0.65
question
0.51
موضوع
0.51
answer
0.50
assunto
0.50
phenomenon
0.49
Answer
0.49
topik
0.48
답변
0.47
onderwerp
0.45
Activations Density 0.004%