INDEX
Explanations
proper names with an apostrophe
instances of the letter 'O' followed by a single quote
New Auto-Interp
Negative Logits
vg
-0.79
eclipse
-0.74
incent
-0.68
thrill
-0.67
academics
-0.67
anwhile
-0.66
»Ĵ
-0.63
iae
-0.62
asers
-0.62
OPLE
-0.62
POSITIVE LOGITS
Brien
1.08
Neill
1.01
Connor
1.00
Malley
0.97
Donnell
0.94
clock
0.92
Leary
0.91
Reilly
0.85
Sullivan
0.82
Tel
0.79
Activations Density 0.015%