INDEX
Explanations
mentions of weather conditions
New Auto-Interp
Head Attr Weights
0:0.41
1:0.02
2:0.12
3:0.08
4:0.03
5:0.05
6:0.02
7:0.04
8:0.03
9:0.03
10:0.11
11:0.02
Negative Logits
filmmaking
-2.64
founders
-2.59
premise
-2.57
filmmakers
-2.53
exhib
-2.50
premises
-2.46
Osw
-2.43
Prometheus
-2.43
shipment
-2.34
Cinem
-2.32
POSITIVE LOGITS
guards
2.42
worsened
2.42
worsen
2.38
favourable
2.35
unpredictable
2.32
worsening
2.28
unpredict
2.27
黒
2.24
iven
2.24
Patch
2.19
Activations Density 0.076%