INDEX
Explanations
instructions and steps for navigating user interfaces
New Auto-Interp
Head Attr Weights
0:0.11
1:0.03
2:0.08
3:0.18
4:0.04
5:0.25
6:0.02
7:0.07
8:0.04
9:0.03
10:0.05
11:0.03
Negative Logits
mockery
-1.71
snipers
-1.58
hindsight
-1.55
DoS
-1.55
clerks
-1.52
confusion
-1.51
pedestrians
-1.50
fools
-1.49
motorists
-1.48
bystanders
-1.48
POSITIVE LOGITS
Selected
2.42
ategories
1.86
Categories
1.81
Schedule
1.81
Contents
1.78
ibliography
1.72
................................................................
1.72
Sources
1.71
tab
1.66
selected
1.65
Activations Density 0.546%