INDEX
Explanations
proper nouns, particularly names of places and institutions
New Auto-Interp
Negative Logits
ंदीखरीदारी
-0.88
تانيه
-0.82
ambut
-0.70
يكب
-0.69
orcid
-0.68
Демографія
-0.67
auffi
-0.64
autorest
-0.63
iſt
-0.61
Monfieur
-0.61
POSITIVE LOGITS
Philadelphia
1.00
Philadelphia
1.00
Philly
0.97
Pennsylvania
0.96
Pennsylvania
0.89
Philly
0.86
Connecticut
0.83
ADELPHIA
0.78
Connecticut
0.77
Rhode
0.77
Activations Density 0.708%