INDEX
Explanations
references to various wars and military conflicts
New Auto-Interp
Negative Logits
essen
-0.16
ing
-0.14
ities
-0.14
eman
-0.14
ife
-0.14
ions
-0.14
leine
-0.13
/view
-0.13
erson
-0.13
umberland
-0.13
POSITIVE LOGITS
-era
0.44
era
0.35
Era
0.33
era
0.29
-period
0.22
ERA
0.22
period
0.19
bler
0.19
ÙħÛĮÙĦادÛĮ
0.18
dönemde
0.17
Activations Density 0.033%