INDEX
Explanations
adverbs and descriptive words
expressions of disbelief or criticism
New Auto-Interp
Negative Logits
successfully
-0.71
interrupted
-0.62
usalem
-0.62
secured
-0.61
respectively
-0.61
colon
-0.60
benefic
-0.59
predetermined
-0.59
kamp
-0.59
grievance
-0.58
POSITIVE LOGITS
atches
0.80
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
0.73
eps
0.70
Ĥİ
0.69
compared
0.67
than
0.67
deals
0.66
til
0.65
notations
0.65
heres
0.64
Activations Density 0.405%