INDEX
Explanations
references to pockets in various contexts
New Auto-Interp
Negative Logits
sen
-0.18
zeitig
-0.17
eng
-0.17
sm
-0.16
si
-0.16
sb
-0.15
san
-0.15
yg
-0.15
Uy
-0.15
sz
-0.15
POSITIVE LOGITS
rung
0.19
omial
0.17
itan
0.16
pit
0.16
ized
0.16
ization
0.16
laus
0.16
atrix
0.16
ishment
0.15
loub
0.15
Activations Density 0.063%