INDEX
Negative Logits
}&
0.44
neş
0.43
FormParams
0.41
جزء
0.40
vivi
0.40
lau
0.39
టెస్
0.39
متش
0.39
방법을
0.39
Uid
0.39
POSITIVE LOGITS
signals
0.46
agendas
0.44
opacity
0.43
watercolor
0.43
signal
0.43
opposition
0.43
narratives
0.41
\{-\0.41
broadly
0.40
audiences
0.40
Activations Density 0.000%