INDEX
Explanations
information related to news articles or reports
phrases related to military actions and societal unrest
New Auto-Interp
Negative Logits
wonderful
-0.60
partName
-0.59
NEVER
-0.58
LOT
-0.56
estern
-0.56
ONLY
-0.53
theless
-0.53
VERY
-0.53
HUGE
-0.52
doesnt
-0.51
POSITIVE LOGITS
.''.
1.14
.[
1.08
.
1.02
.).
1.00
.</
0.97
.''
0.96
.'
0.95
'.
0.92
.]
0.91
.ãĢį
0.89
Activations Density 1.293%