INDEX
Explanations
phrases indicating obligations or restrictions regarding actions
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.06
3:0.07
4:0.04
5:0.03
6:0.16
7:0.37
8:0.02
9:0.03
10:0.05
11:0.09
Negative Logits
orr
-1.38
chord
-1.36
irregularities
-1.36
semantic
-1.32
languages
-1.31
76561
-1.29
MpServer
-1.28
labyrinth
-1.26
resemblance
-1.26
charm
-1.25
POSITIVE LOGITS
retire
1.45
deterrence
1.37
outweigh
1.36
Mechdragon
1.34
olate
1.33
ichita
1.31
ortality
1.30
inance
1.30
Mount
1.30
utenant
1.28
Activations Density 0.069%