INDEX
Negative Logits
_mb
-0.06
tur
-0.06
amma
-0.06
vortex
-0.06
ерк
-0.06
hovering
-0.06
뉴스
-0.06
.dropout
-0.06
LTE
-0.06
enums
-0.06
POSITIVE LOGITS
poisoned
0.08
soothing
0.07
scorer
0.07
damage
0.07
GK
0.07
DateTime
0.07
Fighting
0.06
defending
0.06
abuse
0.06
。',↵
0.06
Activations Density 0.002%