INDEX
    Explanations

    proper nouns, particularly names and places

    New Auto-Interp
    Negative Logits
     Lauder
    -0.66
    hadur
    -0.65
    บาย
    -0.63
    backer
    -0.60
     réaliste
    -0.60
    twimg
    -0.60
     Hig
    -0.59
    Ľ
    -0.59
     tranquille
    -0.59
    isateur
    -0.59
    POSITIVE LOGITS
     Memphis
    0.77
     Tamil
    0.73
     Tenn
    0.72
    RegressionTest
    0.72
    Memphis
    0.69
    NESSEE
    0.68
    Tennessee
    0.68
    Tamil
    0.67
     Nashville
    0.66
     Tennessee
    0.65
    Act Density 0.681%

    No Known Activations