INDEX
Explanations
dates, numbers, locations, and rankings within texts
New Auto-Interp
Negative Logits
glim
-0.79
othe
-0.73
needle
-0.66
shar
-0.66
polic
-0.63
ople
-0.63
serv
-0.63
corrid
-0.62
ranc
-0.62
omorph
-0.60
POSITIVE LOGITS
2008
1.75
1995
1.73
2009
1.73
2010
1.73
2009
1.72
2010
1.72
1997
1.72
2008
1.71
1998
1.71
2005
1.71
Activations Density 0.084%