INDEX
Explanations
names of political figures
New Auto-Interp
Negative Logits
ï¸ı
-0.80
Skydragon
-0.75
catentry
-0.64
REF
-0.63
Pwr
-0.62
perse
-0.62
fml
-0.61
territorial
-0.61
hormonal
-0.60
RET
-0.57
POSITIVE LOGITS
hof
1.04
pas
0.73
iae
0.71
haus
0.71
nder
0.69
ault
0.68
acht
0.68
eport
0.67
eeks
0.67
ieth
0.67
Activations Density 0.050%