INDEX
Explanations
phrases related to legal and political matters
terms related to legal and political contexts
New Auto-Interp
Negative Logits
+.
-0.68
$.
-0.65
*.
-0.60
_.
-0.58
!.
-0.57
ecause
-0.55
rather
-0.54
;;;;
-0.54
*.
-0.53
instead
-0.53
POSITIVE LOGITS
consists
0.76
consisted
0.75
comprises
0.68
is
0.68
seemed
0.68
has
0.68
appears
0.65
continues
0.64
isn
0.63
seems
0.63
Activations Density 0.706%