INDEX
Explanations
pronouns and their usage in relation to agency and actions within a context
New Auto-Interp
Negative Logits
ocker
-0.16
apper
-0.16
Evet
-0.15
upe
-0.15
asty
-0.15
nees
-0.15
484
-0.14
iki
-0.14
[assembly
-0.14
bans
-0.14
POSITIVE LOGITS
rogram
0.15
sıras
0.14
ocr
0.14
.scalablytyped
0.14
.volley
0.14
auce
0.14
è´¢
0.14
osemite
0.13
irts
0.13
íħĶ
0.13
Activations Density 0.477%