INDEX
Explanations
phrases related to military operations and conflict
references to significant military or legal events
New Auto-Interp
Negative Logits
minist
-0.63
Sym
-0.62
Sund
-0.59
IRC
-0.59
administ
-0.57
ERG
-0.56
Var
-0.56
Sit
-0.54
Rot
-0.53
ACTIONS
-0.52
POSITIVE LOGITS
.''.
0.99
etc
0.91
respectively
0.90
)).
0.80
.).
0.76
.''
0.74
.[
0.74
."[
0.73
''.
0.70
.'"
0.70
Activations Density 1.207%