INDEX
Explanations
references to New York or associated abbreviations
New Auto-Interp
Negative Logits
lem
-0.17
yi
-0.15
na
-0.15
cân
-0.15
célib
-0.14
overs
-0.14
DataURL
-0.14
ocator
-0.14
zÃŃ
-0.14
paren
-0.14
POSITIVE LOGITS
quist
0.25
bble
0.24
QUI
0.21
times
0.20
gren
0.19
NÃį
0.18
daily
0.18
borg
0.18
lon
0.18
togroup
0.17
Activations Density 0.017%