INDEX
Explanations
references to the year 2001
references to the year 2001
New Auto-Interp
Negative Logits
accompan
-0.75
Nation
-0.74
ony
-0.69
atted
-0.69
attled
-0.67
shaw
-0.66
rom
-0.66
agues
-0.66
asts
-0.64
alling
-0.63
POSITIVE LOGITS
ĸļ
0.98
ãĥĥãĥī
0.89
-'
0.84
ãĥ³ãĤ¸
0.78
sov
0.73
etics
0.73
ocide
0.72
aru
0.68
ÙĨ
0.68
ãĥ¼ãĥĨãĤ£
0.67
Activations Density 0.012%