INDEX
Negative Logits
certainly
-0.09
Peterson
-0.08
cess
-0.08
sincere
-0.08
Tuy
-0.07
诚
-0.07
Santiago
-0.07
exhilarating
-0.07
surely
-0.07
-known
-0.07
POSITIVE LOGITS
делать
0.09
Neither
0.08
Neither
0.08
/null
0.08
비롯
0.08
clip
0.08
hace
0.08
Wa
0.08
neither
0.07
Wa
0.07
Activations Density 0.004%