INDEX
Explanations
mentions of legal citations or references
references to legal citations
New Auto-Interp
Negative Logits
yss
-0.73
rox
-0.73
yles
-0.71
nut
-0.71
hm
-0.70
independ
-0.69
wagen
-0.69
hma
-0.68
tty
-0.68
ebus
-0.67
POSITIVE LOGITS
citation
1.51
citations
1.41
Citation
0.93
cited
0.85
Clicker
0.81
footnote
0.79
cite
0.78
evid
0.75
ibli
0.74
Forbidden
0.71
Activations Density 0.014%