INDEX
Explanations
occurrences of verbs or phrases indicating future intentions or confirmations
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.10
3:0.13
4:0.13
5:0.02
6:0.20
7:0.15
8:0.03
9:0.03
10:0.07
11:0.06
Negative Logits
instead
-1.44
instead
-1.38
Wouldn
-1.35
izontal
-1.31
rather
-1.29
simplicity
-1.27
sake
-1.22
ordinary
-1.21
icons
-1.19
rather
-1.19
POSITIVE LOGITS
yet
2.03
satisf
1.69
yet
1.65
confir
1.62
nor
1.51
acknow
1.47
officially
1.44
agre
1.40
laun
1.40
formally
1.39
Activations Density 0.006%