INDEX
Explanations
instances of pronouns and their associated actions or states
New Auto-Interp
Negative Logits
oppel
-0.15
opsis
-0.15
Semester
-0.14
uzzi
-0.14
во
-0.14
KF
-0.14
û
-0.14
bf
-0.13
θεν
-0.13
stvo
-0.13
POSITIVE LOGITS
amen
0.16
atham
0.14
inding
0.14
à¹Ģà¸Ĺ
0.14
cite
0.14
anted
0.14
dik
0.14
Miss
0.14
REATE
0.14
Scoped
0.13
Activations Density 0.037%