INDEX
Negative Logits
ãĥ¥
-0.88
fut
-0.68
shorth
-0.67
answ
-0.67
buck
-0.65
sing
-0.65
artif
-0.65
commer
-0.64
footing
-0.63
matical
-0.62
POSITIVE LOGITS
âķ
0.83
srfAttach
0.77
//[
0.75
=>
0.74
>]
0.71
Parables
0.70
*.
0.70
^{0.68
Committees
0.66
ONSORED
0.66
Activations Density 11.067%