INDEX
Explanations
instances of questioning or inquiry in dialogue
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.05
3:0.37
4:0.05
5:0.03
6:0.08
7:0.08
8:0.03
9:0.09
10:0.04
11:0.05
Negative Logits
oise
-2.25
answered
-2.21
answered
-1.95
cean
-1.88
wikipedia
-1.85
anos
-1.84
censored
-1.83
cons
-1.82
sett
-1.81
Wend
-1.81
POSITIVE LOGITS
Quarterly
2.28
Bulletin
2.25
Roll
2.19
Appearances
2.14
ACTION
2.09
Miscellaneous
2.07
Charge
2.06
TAG
2.01
utenberg
2.00
Monthly
1.99
Activations Density 0.020%