INDEX
Explanations
proper nouns, particularly names of people and institutions
New Auto-Interp
Negative Logits
Outlet
-0.15
ninger
-0.14
jang
-0.14
AccessException
-0.14
yg
-0.14
sob
-0.14
outlets
-0.14
arness
-0.14
arken
-0.13
.utilities
-0.13
POSITIVE LOGITS
lessly
0.17
ovna
0.16
Stamp
0.15
ently
0.15
mænd
0.14
Lesser
0.14
ifton
0.14
AGE
0.13
enderit
0.13
Esc
0.13
Activations Density 0.045%