INDEX
Explanations
references to legal documents or regulatory agencies
occurrences of brackets or bracket-like characters
New Auto-Interp
Negative Logits
hog
-0.81
lemon
-0.71
consumption
-0.69
uncertain
-0.65
maximum
-0.65
seams
-0.65
stag
-0.63
beet
-0.63
Orn
-0.62
raspberry
-0.62
POSITIVE LOGITS
Pg
1.34
â̦]
1.26
...]
1.23
sic
1.08
paragraph
1.04
Footnote
0.99
interstitial
0.97
src
0.96
?]
0.92
](
0.88
Activations Density 0.019%