INDEX
Explanations
instances of the word "get" in various contexts
New Auto-Interp
Negative Logits
oje
-0.16
ction
-0.15
alama
-0.15
net
-0.15
nez
-0.15
баÑĩ
-0.14
icken
-0.14
олом
-0.14
nable
-0.14
859
-0.14
POSITIVE LOGITS
rid
0.16
vro
0.16
ters
0.16
-dr
0.14
GMEM
0.14
ONES
0.14
keh
0.14
vr
0.14
ewis
0.13
apy
0.13
Activations Density 0.060%