INDEX
Explanations
proper nouns, particularly names and places
New Auto-Interp
Negative Logits
Lauder
-0.66
hadur
-0.65
บาย
-0.63
backer
-0.60
réaliste
-0.60
twimg
-0.60
Hig
-0.59
Ľ
-0.59
tranquille
-0.59
isateur
-0.59
POSITIVE LOGITS
Memphis
0.77
Tamil
0.73
Tenn
0.72
RegressionTest
0.72
Memphis
0.69
NESSEE
0.68
Tennessee
0.68
Tamil
0.67
Nashville
0.66
Tennessee
0.65
Activations Density 0.681%