INDEX
Explanations
numbers and words related to events or controversial matters
New Auto-Interp
Negative Logits
Ĥª
-0.67
Tycoon
-0.62
«ĺ
-0.59
raids
-0.59
Annotations
-0.59
Controlled
-0.57
Rally
-0.57
Siege
-0.57
Bounce
-0.56
Elections
-0.56
POSITIVE LOGITS
own
1.02
ancer
0.97
escription
0.90
inal
0.88
nesday
0.84
paren
0.84
icular
0.83
uc
0.82
alf
0.81
ational
0.81
Activations Density 0.170%