INDEX
Explanations
the word "only" in various contexts
New Auto-Interp
Negative Logits
externi
-0.58
oppure
-0.53
meurt
-0.52
magari
-0.50
fuoco
-0.50
véhic
-0.49
bower
-0.49
多人
-0.49
souhaite
-0.49
Nähe
-0.49
POSITIVE LOGITS
satunya
1.32
jedin
0.99
唯一的
0.93
唯一
0.87
enige
0.86
eneste
0.81
einzigen
0.78
sole
0.77
Signalez
0.76
един
0.76
Activations Density 0.160%