INDEX
Negative Logits
connected
0.48
oft
0.46
collective
0.46
inhibitors
0.46
knows
0.44
ong
0.44
showed
0.44
bs
0.43
handlebar
0.43
kau
0.43
POSITIVE LOGITS
쐞
0.59
cárcel
0.54
рода
0.53
חס
0.53
commentaires
0.49
_{+}^{0.49
COMMENT
0.49
ファイ
0.49
тексто
0.49
שׁ
0.49
Activations Density 0.006%