INDEX
Explanations
mentions of United States history from the 1700-1800s along with place names and political terms related to the same eras.
place names
New Auto-Interp
Negative Logits
R
-0.53
L
-0.52
M
-0.51
ing
-0.50
İY
-0.49
g
-0.48
l
-0.48
r
-0.46
class
-0.46
R
-0.45
POSITIVE LOGITS
contentLoaded
0.87
ViewImports
0.82
MessageTagHelper
0.77
/−
0.77
الرياضيه
0.75
")));
0.75
nakalista
0.75
Roskov
0.75
]
0.73
'}>
0.72
Activations Density 32.577%