INDEX
Negative Logits
backstory
0.57
कथित
0.55
isoform
0.55
cognit
0.55
hypothesized
0.54
homophobic
0.53
ഷേധ
0.52
deont
0.52
underwhelming
0.51
ადამიან
0.50
POSITIVE LOGITS
NEW
0.55
ALL
0.54
unbeatable
0.54
FREE
0.52
SPECIAL
0.52
new
0.51
stets
0.51
prices
0.50
All
0.49
!
0.49
Activations Density 0.001%