INDEX
Explanations
phrases related to time and events
specific numerals, particularly related to quantities or measurements
New Auto-Interp
Negative Logits
Jur
-0.54
SPACE
-0.52
Auth
-0.51
ONEY
-0.50
Present
-0.50
wolves
-0.50
Kills
-0.49
hess
-0.49
zynski
-0.49
Hannity
-0.48
POSITIVE LOGITS
terday
0.72
-+-+
0.66
iste
0.63
newcom
0.63
tenance
0.60
ãĥ¼ãĥĨ
0.57
velength
0.56
osterone
0.56
mares
0.56
iterator
0.56
Activations Density 0.851%