INDEX
Explanations
phrases that express expectations or surprises related to outcomes
New Auto-Interp
Negative Logits
ħĮ
-0.07
dio
-0.07
ipse
-0.07
ableViewController
-0.06
IPS
-0.06
ESCO
-0.06
cef
-0.06
mev
-0.06
inois
-0.06
Ñĥков
-0.06
POSITIVE LOGITS
surprising
0.18
surprises
0.17
surprisingly
0.16
surprise
0.16
surpr
0.14
surprised
0.14
Surprise
0.14
pÅĻekvap
0.13
unexpected
0.12
unexpected
0.11
Activations Density 0.034%