INDEX
Explanations
dates, particularly the year "2001"
occurrences of the year 2001
New Auto-Interp
Negative Logits
accompan
-0.76
anch
-0.69
Nation
-0.69
ony
-0.68
rom
-0.68
romancer
-0.66
alling
-0.66
bon
-0.65
lying
-0.64
tics
-0.64
POSITIVE LOGITS
ĸļ
0.96
ãĥĥãĥī
0.87
-'
0.86
etics
0.74
sov
0.70
aru
0.69
ÙĨ
0.69
å¹
0.69
ãĥ³ãĤ¸
0.68
UFC
0.67
Activations Density 0.019%