INDEX
Explanations
terms related to political news and government actions
New Auto-Interp
Negative Logits
Ded
-0.63
natureconservancy
-0.62
Balt
-0.61
partName
-0.59
nces
-0.58
arij
-0.58
journal
-0.58
horm
-0.57
obser
-0.56
ascript
-0.55
POSITIVE LOGITS
*.
0.81
hegemony
0.78
'.
0.77
.''.
0.75
ãĢĤ
0.74
'."
0.73
.
0.71
amid
0.69
.*
0.68
.'
0.68
Activations Density 0.439%