INDEX
Explanations
first-person questions and statements about personal experiences or actions
New Auto-Interp
Negative Logits
meld
-0.16
lobs
-0.15
ÑĢÑı
-0.15
iston
-0.14
acock
-0.14
OND
-0.14
ilde
-0.14
odash
-0.14
ourg
-0.14
Gibson
-0.13
POSITIVE LOGITS
ee
0.18
ees
0.15
UTE
0.15
roit
0.14
veh
0.14
ree
0.14
uble
0.13
λÏħ
0.13
PointF
0.13
doll
0.13
Activations Density 0.055%