INDEX
Explanations
instances of legal reasoning
Summarizing
New Auto-Interp
Negative Logits
Above
-2.14
above
-2.13
above
-2.08
Above
-2.03
ABOVE
-1.84
mentioned
-1.73
below
-1.69
below
-1.69
acima
-1.59
mentioned
-1.58
POSITIVE LOGITS
previously
1.05
Previously
1.02
previously
0.96
previous
0.94
Previously
0.94
auparavant
0.87
previamente
0.84
previous
0.81
Previous
0.79
PREVIOUS
0.79
Activations Density 1.744%