INDEX
Explanations
the word "that" and other general connectors or qualifying terms in the text
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.13
3:0.05
4:0.15
5:0.03
6:0.04
7:0.30
8:0.06
9:0.03
10:0.05
11:0.07
Negative Logits
esters
-2.29
ester
-1.96
forts
-1.78
uel
-1.75
eger
-1.65
plets
-1.60
Lans
-1.60
users
-1.53
ographers
-1.50
uries
-1.48
POSITIVE LOGITS
direction
1.86
allegiance
1.76
posture
1.55
belief
1.55
regard
1.51
stance
1.51
knots
1.50
outlook
1.50
affirmation
1.50
refusal
1.49
Activations Density 0.001%