INDEX
Explanations
proper nouns and associated organizations or titles
instances of punctuation, particularly periods in the text
New Auto-Interp
Negative Logits
stray
-0.72
dips
-0.69
indul
-0.68
achievable
-0.67
riet
-0.67
urance
-0.66
izoph
-0.65
fleeting
-0.65
ballpark
-0.65
pees
-0.64
POSITIVE LOGITS
His
1.18
Together
1.18
Both
1.15
He
1.15
She
1.14
Likewise
1.14
Similarly
1.10
Her
1.07
Previously
1.05
Others
1.03
Activations Density 0.637%