INDEX
Explanations
entities related to legal cases and proceedings
New Auto-Interp
Negative Logits
),
-0.83
,
-0.76
which
-0.75
",
-0.74
”,
-0.72
}$,
-0.66
-
-0.64
whereas
-0.63
,
-0.60
but
-0.60
POSITIVE LOGITS
."""
1.10
》.
1.05
).
0.99
.}
0.96
.}}
0.95
.
0.92
.]
0.91
."]
0.91
дописавши
0.90
}.
0.90
Activations Density 0.997%