INDEX
Explanations
instances of the word "in"
New Auto-Interp
Head Attr Weights
0:0.09
1:0.11
2:0.05
3:0.05
4:0.04
5:0.09
6:0.13
7:0.07
8:0.10
9:0.05
10:0.08
11:0.08
Negative Logits
Ballard
-1.74
Acc
-1.71
Acc
-1.62
Colleges
-1.61
Pric
-1.55
Mot
-1.54
Coat
-1.52
Ap
-1.51
Atlantic
-1.51
Aeg
-1.49
POSITIVE LOGITS
teasp
1.74
bleed
1.64
�
1.61
diff
1.61
ÃÂ
1.61
destro
1.60
waning
1.60
millenn
1.58
rall
1.55
batter
1.55
Activations Density 0.000%