INDEX
Explanations
pronouns and their associated actions in a narrative context
New Auto-Interp
Negative Logits
cour
-0.16
enler
-0.14
createDate
-0.14
ubat
-0.14
iated
-0.14
asted
-0.14
alsex
-0.14
ius
-0.13
lete
-0.13
icas
-0.13
POSITIVE LOGITS
ottom
0.16
eyJ
0.15
iver
0.14
ž
0.14
atri
0.14
izik
0.14
rary
0.13
ãĥĥãĥĪ
0.13
Huff
0.13
Doll
0.13
Activations Density 0.066%