INDEX
Negative Logits
ATOR
-0.84
ate
-0.77
fect
-0.73
egal
-0.72
åĬ
-0.72
aeda
-0.71
onne
-0.70
ators
-0.69
respons
-0.69
agos
-0.68
POSITIVE LOGITS
afternoon
1.58
morning
1.55
mornings
1.50
night
1.49
evening
1.42
Night
1.32
nights
1.29
evenings
1.11
Evening
1.07
morning
1.06
Activations Density 0.030%