INDEX
Explanations
instances of the word "didn't" and variations, particularly in questions and negatives
"did" and its capitalization variations
past tense questions and statements
New Auto-Interp
Negative Logits
rungsseite
-0.72
Houſe
-0.66
houſe
-0.63
pleaſure
-0.60
featureID
-0.60
estekak
-0.60
itſelf
-0.59
fillType
-0.59
protoimpl
-0.57
ientôt
-0.57
POSITIVE LOGITS
Did
0.65
Did
0.59
did
0.57
DID
0.56
did
0.55
DID
0.51
originally
0.50
previously
0.47
recently
0.44
volna
0.41
Activations Density 0.096%