INDEX
Negative Logits
uzzle
-0.29
compl
-0.28
ocop
-0.28
usta
-0.27
Gast
-0.27
occup
-0.27
cmp
-0.27
ä¸ĵ访
-0.26
ActionType
-0.26
åµĮ
-0.25
POSITIVE LOGITS
ili
0.30
ä¸Ĭä¸ĭ游
0.26
rium
0.26
Kil
0.26
ability
0.25
orado
0.25
åĬĽè¿ĺæĺ¯
0.25
mind
0.25
ç±³ç²ī
0.25
(low
0.25
Activations Density 0.656%