INDEX
Explanations
proper nouns preceded by an apostrophe
instances of the letter 'O' followed by an apostrophe
New Auto-Interp
Negative Logits
eclipse
-0.78
etheless
-0.74
vg
-0.65
inhibited
-0.65
anwhile
-0.63
»Ĵ
-0.62
cius
-0.62
urers
-0.62
thrill
-0.62
asers
-0.61
POSITIVE LOGITS
Brien
1.00
Connor
0.97
clock
0.89
Neill
0.89
Malley
0.87
Donnell
0.86
Leary
0.82
Rah
0.79
kay
0.77
Reilly
0.77
Activations Density 0.015%