INDEX
Explanations
references to dates and numerical values related to events
New Auto-Interp
Head Attr Weights
0:0.20
1:0.15
2:0.08
3:0.05
4:0.05
5:0.09
6:0.03
7:0.02
8:0.07
9:0.07
10:0.07
11:0.07
Negative Logits
yrinth
-2.00
chieve
-1.99
ibaba
-1.99
eatures
-1.94
inki
-1.92
matical
-1.87
Reviewer
-1.84
uilt
-1.84
isoft
-1.84
avorite
-1.84
POSITIVE LOGITS
:
1.79
Usually
1.76
@
1.75
,
1.69
;
1.68
Colin
1.64
,"
1.63
Joel
1.63
article
1.62
:(
1.56
Activations Density 0.001%