INDEX
Explanations
various forms of the word "for" and its associated phrases
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.05
3:0.06
4:0.05
5:0.04
6:0.42
7:0.03
8:0.06
9:0.07
10:0.07
11:0.05
Negative Logits
Channel
-1.38
Cly
-1.36
ktop
-1.30
orphans
-1.18
Picks
-1.10
lil
-1.09
Brill
-1.08
FTWARE
-1.04
knot
-1.04
Horde
-1.04
POSITIVE LOGITS
��
1.79
acus
1.68
oux
1.45
hene
1.43
opa
1.42
ón
1.39
cedented
1.38
inous
1.38
assault
1.37
utical
1.36
Activations Density 0.005%