INDEX
Explanations
references to legal principles or terminology
New Auto-Interp
Negative Logits
rlen
-0.20
icolon
-0.16
achsen
-0.15
_python
-0.15
Journal
-0.14
mic
-0.14
Tribe
-0.14
ouri
-0.14
dump
-0.14
journal
-0.14
POSITIVE LOGITS
Aqu
0.27
Scot
0.26
Dominican
0.25
gloss
0.24
Aqu
0.23
sch
0.21
Dante
0.21
Fri
0.21
Gros
0.21
medieval
0.20
Activations Density 0.039%