INDEX
Explanations
punctuation marks, specifically parentheses and colons
New Auto-Interp
Negative Logits
BibitemShut
-0.89
$_"
-0.82
bibfield
-0.79
pleaſure
-0.78
déput
-0.76
feroit
-0.75
vérit
-0.74
protoimpl
-0.74
mariée
-0.73
kaarangay
-0.71
POSITIVE LOGITS
his
0.78
↵
0.75
>{@0.65
0.64
him
0.62
"
0.59
↵↵
0.58
'
0.57
-
0.56
himself
0.56
Activations Density 0.329%