INDEX
Explanations
imperative verbs indicating actions or commands
New Auto-Interp
Head Attr Weights
0:0.07
1:0.08
2:0.10
3:0.09
4:0.07
5:0.08
6:0.07
7:0.07
8:0.06
9:0.08
10:0.09
11:0.09
Negative Logits
synergy
-2.68
HER
-2.61
synerg
-2.61
requ
-2.55
��
-2.54
glim
-2.43
venge
-2.38
accompan
-2.37
streng
-2.36
oldown
-2.34
POSITIVE LOGITS
Livingston
2.17
Retrieved
2.15
Released
2.06
pg
2.02
Writing
2.02
pp
1.98
Explain
1.96
EDT
1.95
Locked
1.95
Fact
1.93
Activations Density 0.000%