INDEX
    Explanations

    mentions of the city "Detroit."

    New Auto-Interp
    Negative Logits
    <bos>
    -0.87
     intersper
    -0.82
     Lorsqu
    -0.76
     encomp
    -0.72
     reconno
    -0.69
     apprehen
    -0.66
     gild
    -0.66
     Jusqu
    -0.65
     Pense
    -0.65
     unve
    -0.64
    POSITIVE LOGITS
     Detroit
    1.02
    Detroit
    0.97
     DETROIT
    0.90
    detroit
    0.90
     Det
    0.87
    ROIT
    0.75
     PLWABN
    0.75
    Det
    0.72
     det
    0.71
     DET
    0.67
    Act Density 0.619%

    No Known Activations