INDEX
Explanations
concepts related to environmental issues and responsibility
New Auto-Interp
Head Attr Weights
0:0.08
1:0.02
2:0.12
3:0.13
4:0.23
5:0.04
6:0.05
7:0.02
8:0.10
9:0.05
10:0.05
11:0.07
Negative Logits
GOODMAN
-1.81
ixties
-1.44
urbed
-1.42
orie
-1.41
}}
-1.39
¶
-1.37
士
-1.36
oren
-1.35
}}}
-1.31
Yesterday
-1.29
POSITIVE LOGITS
interstitial
1.57
rather
1.49
Interstitial
1.37
nonetheless
1.36
fund
1.35
Burr
1.32
Incarn
1.31
rather
1.29
Actual
1.26
selves
1.25
Activations Density 0.079%