INDEX
Explanations
phrases that emphasize particular significance or importance
New Auto-Interp
Negative Logits
noDo
-0.73
mika
-0.62
Salim
-0.61
ogly
-0.61
vestibular
-0.59
Vah
-0.58
Poz
-0.58
Bim
-0.58
mourut
-0.58
Vri
-0.57
POSITIVE LOGITS
Especially
1.89
Especially
1.83
especially
1.83
especially
1.79
Particularly
1.75
particularly
1.69
particularly
1.67
pecially
1.59
ticularly
1.57
особенно
1.46
Activations Density 0.114%