INDEX
Explanations
words related to difficulty or intensity
New Auto-Interp
Negative Logits
hig
-0.77
Interstitial
-0.69
umbn
-0.66
edy
-0.66
nikov
-0.65
ATOR
-0.64
iae
-0.63
ioxide
-0.63
ãĤ´ãĥ³
-0.63
comings
-0.62
POSITIVE LOGITS
they
0.95
soever
0.84
we
0.83
he
0.81
THEY
0.81
you
0.77
she
0.75
constitutes
0.73
thou
0.73
awaits
0.72
Activations Density 0.926%