INDEX
Explanations
words that indicate computer code syntax
references to traits of dishonesty or treachery
New Auto-Interp
Negative Logits
intosh
-0.83
dos
-0.78
ertodd
-0.74
elaide
-0.73
daq
-0.72
rodu
-0.72
yles
-0.68
bons
-0.64
prus
-0.64
tel
-0.63
POSITIVE LOGITS
aries
1.04
tru
1.01
ances
0.94
arily
0.93
ary
0.91
shire
0.89
icity
0.86
arial
0.83
ariat
0.79
aires
0.76
Activations Density 0.031%