INDEX
Negative Logits
Bris
-0.75
Dalla
-0.73
ционной
-0.72
︎
-0.67
ali
-0.65
expre
-0.64
egy
-0.63
Bris
-0.62
Cleopatra
-0.62
പ്
-0.61
POSITIVE LOGITS
Lords
1.40
LORD
1.27
lords
1.25
Lord
1.23
lord
1.21
LORD
1.19
Lord
1.10
lordship
1.08
lord
0.95
Lordship
0.95
Activations Density 0.008%