INDEX
Negative Logits
ãģıãĤĵ
-0.18
aldi
-0.17
/she
-0.17
himself
-0.17
stesso
-0.15
妻
-0.15
اÙĨات
-0.15
LEC
-0.15
erral
-0.15
ilip
-0.15
POSITIVE LOGITS
hood
0.20
herself
0.20
iser
0.18
etics
0.18
izer
0.18
athed
0.17
empowerment
0.16
/people
0.15
.gwt
0.15
folk
0.15
Activations Density 0.064%