INDEX
Explanations
proper names or terms prefixed with "El"
New Auto-Interp
Negative Logits
hire
-0.65
olicy
-0.63
raft
-0.63
charge
-0.61
Eliot
-0.60
henko
-0.60
nant
-0.59
LESS
-0.59
ratulations
-0.59
ELS
-0.58
POSITIVE LOGITS
Paso
1.19
abor
1.14
igible
1.08
Salvador
1.01
usive
1.00
iza
0.99
Niño
0.99
azer
0.96
apsed
0.93
isa
0.90
Activations Density 0.017%