INDEX
Explanations
active verbs that indicate significant emotional or physical actions
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.10
3:0.09
4:0.06
5:0.02
6:0.28
7:0.12
8:0.05
9:0.02
10:0.07
11:0.06
Negative Logits
Jav
-1.55
Hemp
-1.38
designation
-1.31
filing
-1.31
Cham
-1.31
sake
-1.30
Sham
-1.25
Mandatory
-1.24
Jah
-1.24
Xavier
-1.22
POSITIVE LOGITS
etheless
1.92
grav
1.89
ench
1.73
uncontroll
1.60
ulty
1.55
downwards
1.50
ynes
1.50
blems
1.49
kered
1.49
ract
1.47
Activations Density 0.016%