INDEX
Explanations
statements related to defendants’ arguments and legal justifications
New Auto-Interp
Negative Logits
idot
-0.17
LEC
-0.17
bourg
-0.17
JC
-0.16
REP
-0.15
yth
-0.15
.mb
-0.15
εξ
-0.14
ephy
-0.14
endas
-0.14
POSITIVE LOGITS
olumbia
0.16
äre
0.14
Laure
0.14
æ´ĭ
0.14
antic
0.14
-tools
0.13
onte
0.13
ابÙĦ
0.13
reste
0.13
tik
0.13
Activations Density 0.613%