INDEX
Explanations
political and military-related terms and entities
New Auto-Interp
Negative Logits
ashtra
-0.71
IPM
-0.67
sung
-0.62
ealous
-0.61
equival
-0.60
Hodg
-0.59
ouk
-0.59
Brach
-0.59
axter
-0.58
DEBUG
-0.58
POSITIVE LOGITS
nings
0.85
stadt
0.75
papers
0.69
Rampage
0.67
Cinema
0.66
pac
0.66
fleet
0.65
models
0.65
Geo
0.64
izons
0.62
Activations Density 18.496%