INDEX
Explanations
elements related to recognition or awards
New Auto-Interp
Head Attr Weights
0:0.06
1:0.11
2:0.03
3:0.05
4:0.04
5:0.31
6:0.04
7:0.03
8:0.07
9:0.09
10:0.07
11:0.04
Negative Logits
Olympia
-1.43
injuring
-1.42
responding
-1.41
Bulls
-1.38
shelter
-1.38
Meadows
-1.37
shelters
-1.36
believing
-1.34
electronics
-1.34
safest
-1.34
POSITIVE LOGITS
leader
2.14
reb
1.99
leaders
1.93
moderate
1.92
yet
1.91
recogn
1.86
basic
1.83
average
1.82
tro
1.82
bleacher
1.80
Activations Density 0.001%