INDEX
Explanations
dates in the format of year followed by numerical month with the highest activation on 2011
specific years or dates
New Auto-Interp
Negative Logits
Rath
-0.80
furious
-0.72
VERTISEMENT
-0.71
staggering
-0.68
imb
-0.67
lifelong
-0.67
paddle
-0.65
cheek
-0.65
fer
-0.65
quarter
-0.65
POSITIVE LOGITS
theless
1.34
terday
1.19
DragonMagazine
1.15
odore
0.97
tenance
0.95
1984
0.91
2014
0.90
2001
0.90
romeda
0.87
1981
0.86
Activations Density 0.050%