INDEX
Explanations
words related to nighttime activities or settings
references to nighttime or nightlife
New Auto-Interp
Negative Logits
achev
-0.81
xon
-0.80
pta
-0.79
cules
-0.74
qqa
-0.72
emort
-0.71
berman
-0.69
qua
-0.69
Canaver
-0.68
rompt
-0.68
POSITIVE LOGITS
cap
1.03
fall
1.01
mares
1.00
life
0.98
mar
0.94
club
0.89
mare
0.86
light
0.86
sky
0.80
ow
0.78
Activations Density 0.045%