INDEX
Negative Logits
things
0.50
很多人
0.48
mogelijkheden
0.46
很多
0.46
什麼
0.45
चीजों
0.45
Things
0.45
things
0.44
什么
0.43
многи
0.43
POSITIVE LOGITS
samples
0.70
representative
0.70
pairs
0.68
randomly
0.64
simulated
0.63
selected
0.63
sampled
0.59
representative
0.59
samples
0.58
appropriately
0.57
Activations Density 0.044%