INDEX
Negative Logits
{(--0.73
icylic
-0.61
={`-0.60
заве
-0.58
{(-0.58
bit
-0.58
ailleurs
-0.58
deed
-0.58
hit
-0.58
episode
-0.57
POSITIVE LOGITS
br
1.92
br
1.40
Br
1.40
Br
1.33
BR
1.10
BR
1.04
Brind
0.98
brz
0.89
Brü
0.88
Brink
0.85
Activations Density 0.025%