INDEX
    Explanations

    information about where people live

    phrases that indicate where people reside

    New Auto-Interp
    Negative Logits
    umbs
    -0.79
    elight
    -0.79
    ery
    -0.76
    oug
    -0.75
    xual
    -0.73
    anche
    -0.71
    ion
    -0.70
    aptic
    -0.66
    ional
    -0.65
    sonian
    -0.65
    POSITIVE LOGITS
     upstairs
    0.92
     vic
    0.87
     downstairs
    0.85
    lihood
    0.81
    chool
    0.78
    stead
    0.77
     indoors
    0.75
     abroad
    0.74
     paycheck
    0.74
     peacefully
    0.71
    Act Density 0.036%

    No Known Activations