INDEX
Explanations
quoted speech and dialogue
New Auto-Interp
Negative Logits
ften
-0.15
ember
-0.15
fen
-0.14
_FWD
-0.14
åŁ
-0.14
Wander
-0.14
PMC
-0.14
że
-0.14
lander
-0.14
izza
-0.13
POSITIVE LOGITS
Moreno
0.17
comings
0.16
ardin
0.16
metav
0.15
redi
0.15
overt
0.15
-bind
0.15
ross
0.14
ilip
0.14
ToProps
0.14
Activations Density 0.171%