INDEX
Explanations
pronouns and verbs regarding capability or possibility
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.05
3:0.06
4:0.26
5:0.04
6:0.20
7:0.08
8:0.06
9:0.06
10:0.05
11:0.05
Negative Logits
constitu
-1.66
undrum
-1.50
setback
-1.50
Trin
-1.35
ordeal
-1.35
ゴン
-1.35
predicament
-1.33
trajectory
-1.33
Triple
-1.32
significance
-1.31
POSITIVE LOGITS
sake
1.68
lest
1.52
undo
1.43
don
1.42
ranged
1.37
iates
1.36
ptives
1.31
instead
1.29
docs
1.26
urat
1.26
Activations Density 0.028%