INDEX
Negative Logits
downside
0.42
PIPE
0.39
mistaken
0.38
образовањем
0.38
ஆன்
0.38
mistake
0.37
suitably
0.36
Anim
0.36
perm
0.36
出し
0.36
POSITIVE LOGITS
William
0.54
William
0.52
gathers
0.48
默默
0.48
gathering
0.46
Gathering
0.46
gather
0.46
Gather
0.46
silently
0.45
WILLIAM
0.44
Activations Density 0.001%