INDEX
Explanations
references to nighttime events or themes
New Auto-Interp
Negative Logits
afternoon
-0.70
morning
-0.65
evening
-0.63
afternoons
-0.61
mornings
-0.60
下午
-0.60
daytime
-0.59
Afternoon
-0.59
evenings
-0.57
lunchtime
-0.57
POSITIVE LOGITS
shift
0.64
gown
0.63
ingale
0.61
shift
0.55
mar
0.54
Shift
0.52
cap
0.52
sky
0.49
Shift
0.46
caps
0.45
Activations Density 0.112%