INDEX
Explanations
phrases related to regulations, laws, and controls
instances of punctuation in various forms, particularly periods
New Auto-Interp
Negative Logits
wcs
-0.72
liner
-0.66
nen
-0.63
onite
-0.63
stadt
-0.59
abies
-0.59
ozy
-0.57
cil
-0.56
favour
-0.56
iera
-0.55
POSITIVE LOGITS
.
1.68
."
1.67
.)
1.40
..
1.07
._
1.07
_.
1.04
,"
0.99
-.
0.98
(.
0.82
`.
0.81
Activations Density 0.018%