INDEX
Negative Logits
pays
0.37
arthed
0.37
arden
0.37
monetization
0.36
dieting
0.35
thous
0.35
puppet
0.35
hikers
0.35
monet
0.34
u
0.34
POSITIVE LOGITS
Prog
0.44
Alcan
0.43
onClose
0.41
Bande
0.40
PROBLE
0.38
เนื้อ
0.38
Reach
0.38
графи
0.38
진
0.37
eloquence
0.37
Activations Density 0.000%