INDEX
Negative Logits
+w
-0.07
οργ
-0.07
delta
-0.07
user
-0.06
Kir
-0.06
eder
-0.06
Método
-0.06
real
-0.06
Yer
-0.06
rotated
-0.06
POSITIVE LOGITS
sc
0.11
Sc
0.11
SC
0.09
scout
0.09
scam
0.08
(sc
0.07
Scalia
0.07
(student
0.07
Sc
0.07
scram
0.07
Activations Density 0.071%