INDEX
Negative Logits
Rodrig
-0.08
idar
-0.08
948
-0.08
аем
-0.07
cop
-0.07
Circular
-0.07
dime
-0.07
8
-0.07
48
-0.07
cyclic
-0.07
POSITIVE LOGITS
Health
0.22
health
0.20
Health
0.16
HEALTH
0.15
health
0.14
-health
0.12
.Health
0.12
_HEALTH
0.11
.health
0.09
Healthy
0.09
Activations Density 0.042%