INDEX
Explanations
instances of the word "down"
New Auto-Interp
Head Attr Weights
0:0.08
1:0.09
2:0.08
3:0.08
4:0.08
5:0.07
6:0.07
7:0.08
8:0.07
9:0.08
10:0.07
11:0.10
Negative Logits
respons
-2.88
actionGroup
-2.86
�
-2.83
htt
-2.79
submission
-2.77
Reply
-2.71
Donation
-2.71
)</
-2.68
lia
-2.62
EntityItem
-2.60
POSITIVE LOGITS
sych
3.53
juven
3.37
izo
2.76
Jaguar
2.75
Tigers
2.73
Bigfoot
2.70
Bowie
2.64
Negro
2.63
assault
2.61
odox
2.60
Activations Density 0.000%