INDEX
Explanations
occurrences of verbs and phrases indicating actions and processes
New Auto-Interp
Negative Logits
_CAT
-0.16
Kew
-0.15
/
-0.15
MAV
-0.14
Interpret
-0.14
yon
-0.14
è«ĸ
-0.14
fo
-0.14
Gew
-0.14
Slee
-0.14
POSITIVE LOGITS
_MACRO
0.15
.Listener
0.15
oyer
0.15
ogui
0.15
-alist
0.15
ODB
0.15
.jpa
0.14
ÏĥÏĦά
0.14
Absent
0.14
_AUX
0.14
Activations Density 0.002%