INDEX
Explanations
the imperative form of verbs
New Auto-Interp
Head Attr Weights
0:0.07
1:0.08
2:0.08
3:0.08
4:0.08
5:0.08
6:0.08
7:0.09
8:0.09
9:0.08
10:0.07
11:0.07
Negative Logits
ominated
-2.19
slate
-1.95
Franchise
-1.93
/.
-1.80
crowdfunding
-1.79
calendar
-1.79
phalt
-1.75
hairst
-1.73
benches
-1.72
budget
-1.72
POSITIVE LOGITS
Ctrl
2.35
Takeru
2.18
Dalai
2.10
otonin
2.06
NRS
2.02
ulner
2.02
Leban
2.00
select
1.99
concess
1.99
ctrl
1.98
Activations Density 0.000%