INDEX
Explanations
words related to political figures and entities
abbreviations and acronyms in the text
New Auto-Interp
Negative Logits
rency
-0.85
ãĥ¼ãĥĨ
-0.72
ciating
-0.68
staking
-0.68
fastball
-0.66
neutrality
-0.62
enza
-0.62
compr
-0.62
aturdays
-0.61
CPI
-0.60
POSITIVE LOGITS
opter
0.73
throp
0.72
ovych
0.68
oslav
0.65
################
0.64
scope
0.63
akov
0.63
Harbor
0.63
ng
0.62
ania
0.62
Activations Density 0.222%