INDEX
Explanations
words related to legal terms and offenses
references to criminal charges and legal terminology
New Auto-Interp
Negative Logits
nar
-0.67
sonic
-0.59
scrap
-0.57
scorp
-0.57
wel
-0.57
Weasley
-0.56
sy
-0.56
vre
-0.56
spider
-0.55
pal
-0.55
POSITIVE LOGITS
?????-?????-
0.92
Same
0.85
ccording
0.84
Orderable
0.84
window
0.84
Repl
0.82
?????-
0.82
OVA
0.81
Located
0.80
Allows
0.80
Activations Density 0.149%