INDEX
Explanations
dates written in the format "Month day" (e.g., "Aug 9")
dates, particularly those in August
New Auto-Interp
Negative Logits
unfor
-0.71
bottleneck
-0.69
OGR
-0.67
NIC
-0.65
charged
-0.64
unpre
-0.63
éĹĺ
-0.60
deaf
-0.59
merce
-0.59
isolate
-0.58
POSITIVE LOGITS
mented
1.19
mentation
1.03
enture
0.92
aret
0.89
Aug
0.89
sburg
0.88
iors
0.83
ital
0.80
Aug
0.80
rals
0.80
Activations Density 0.006%