INDEX
Negative Logits
rieg
-0.76
brim
-0.76
ongs
-0.70
76561
-0.69
quit
-0.68
mx
-0.67
atana
-0.65
blogs
-0.65
lyss
-0.64
fing
-0.63
POSITIVE LOGITS
incial
0.66
crow
0.63
itto
0.63
Krish
0.62
Irving
0.61
Autism
0.60
ersen
0.60
Aren
0.58
deficits
0.58
vaccine
0.57
Activations Density 0.000%