INDEX
Explanations
imperative verbs indicating action or movement
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.06
3:0.08
4:0.07
5:0.09
6:0.09
7:0.07
8:0.09
9:0.07
10:0.08
11:0.08
Negative Logits
resignation
-2.10
mourning
-2.10
lawsuits
-2.07
litigation
-2.06
weddings
-2.01
segregation
-2.01
cance
-2.00
tightening
-1.95
shutting
-1.95
discontent
-1.95
POSITIVE LOGITS
pei
2.48
aho
2.31
ichen
2.27
ulhu
2.25
inois
2.24
igree
2.14
guessed
2.14
obi
2.11
agu
2.11
jiang
2.09
Activations Density 0.000%