INDEX
Negative Logits
graphic
-0.31
åĽ¾çĶ»
-0.29
Baum
-0.27
åĽ¾å½¢
-0.26
image
-0.26
erk
-0.26
ateral
-0.26
arrant
-0.26
portrait
-0.26
ä¸Ģå¦Ĥ
-0.25
POSITIVE LOGITS
ricks
0.31
bour
0.30
RID
0.28
rid
0.28
åıŁ
0.28
Touches
0.26
稳
0.25
æ®ĸæ°ij
0.24
RCT
0.24
roid
0.24
Activations Density 0.007%