INDEX
Explanations
verbs indicating actions being taken or events happening
present tense verbs indicating actions or events taking place
New Auto-Interp
Negative Logits
underest
-0.66
/-
-0.63
incorrectly
-0.63
ministic
-0.63
overest
-0.62
lihood
-0.62
anon
-0.61
hes
-0.61
mistake
-0.60
wrong
-0.60
POSITIVE LOGITS
ãĤ©
0.71
ãĥ¥
0.63
redes
0.60
FORM
0.60
festive
0.60
icipated
0.58
advoc
0.58
veland
0.57
ersed
0.57
çīĪ
0.57
Activations Density 0.457%