INDEX
Explanations
expressions of identity and existential questions
New Auto-Interp
Negative Logits
weren
-0.17
suddenly
-0.16
imore
-0.16
uctor
-0.15
;element
-0.15
wasn
-0.15
doesn
-0.14
inia
-0.14
didn
-0.14
declining
-0.14
POSITIVE LOGITS
exist
0.30
exists
0.29
existing
0.26
existence
0.26
existed
0.24
exists
0.24
åŃĺåľ¨
0.22
Exists
0.22
operates
0.22
operate
0.21
Activations Density 0.031%