INDEX
Negative Logits
Businesses
-0.08
�
-0.08
veggie
-0.07
silently
-0.07
елен
-0.07
'l
-0.07
มาณ
-0.07
hij
-0.07
covert
-0.07
��
-0.07
POSITIVE LOGITS
poeta
0.10
poema
0.10
Defender
0.09
defender
0.09
poet
0.09
诗
0.08
nationalism
0.08
poem
0.08
condemnation
0.08
赏
0.08
Activations Density 0.010%