INDEX
Explanations
phrases related to nighttime activities
references to the concept of night
New Auto-Interp
Negative Logits
achev
-0.81
Canaver
-0.79
pta
-0.75
essors
-0.69
elsen
-0.68
cules
-0.67
berman
-0.67
xon
-0.67
ignty
-0.66
ttle
-0.65
POSITIVE LOGITS
cap
1.04
mar
1.01
fall
0.99
life
0.95
light
0.91
night
0.88
mares
0.81
urnal
0.81
stand
0.75
sky
0.75
Activations Density 0.033%