INDEX
Negative Logits
natureconservancy
-0.64
ked
-0.63
zag
-0.62
stration
-0.62
[|
-0.61
stice
-0.61
str
-0.60
agus
-0.60
Institution
-0.59
idad
-0.58
POSITIVE LOGITS
adays
2.06
here
1.41
heres
0.87
herer
0.83
Playing
0.82
suppose
0.81
imagine
0.79
Playing
0.74
Comes
0.73
THAT
0.72
Activations Density 0.038%