INDEX
Explanations
instances of the word "The" at the beginning of sentences
New Auto-Interp
Negative Logits
<eos>
-0.47
TextBoxColumn
-0.36
ể
-0.34
cèse
-0.34
(
-0.33
</h1>
-0.33
rep
-0.32
table
-0.32
count
-0.31
cho
-0.31
POSITIVE LOGITS
houſe
0.88
0.81
ſte
0.79
niſſe
0.79
Houſe
0.78
pleaſure
0.77
ſche
0.75
fashiola
0.74
ſta
0.74
Monfieur
0.74
Activations Density 0.427%