INDEX
Explanations
references to specific years
New Auto-Interp
Negative Logits
969
-0.18
COVID
-0.18
COVID
-0.17
ima
-0.16
569
-0.16
itary
-0.15
Covid
-0.15
59
-0.15
03
-0.15
04
-0.15
POSITIVE LOGITS
Sevent
0.20
ä¸ĥ
0.19
Eight
0.18
Û·
0.18
eight
0.17
à¥Ń
0.17
Eight
0.17
sevent
0.17
ä¸ĥ
0.17
7
0.17
Activations Density 0.048%