INDEX
Explanations
phrases related to legal and political analysis
New Auto-Interp
Negative Logits
Catalog
-0.80
igue
-0.75
istg
-0.73
/"
-0.73
.-
-0.69
Edge
-0.67
far
-0.66
forge
-0.65
.''.
-0.64
](
-0.64
POSITIVE LOGITS
these
0.88
those
0.84
mankind
0.82
humankind
0.80
each
0.77
our
0.77
course
0.75
the
0.73
sorts
0.73
oneself
0.72
Activations Density 1.968%