INDEX
    Explanations

    mentions of people's residences or living situations

    instances of the word "lives" in various contexts

    New Auto-Interp
    Negative Logits
    sonian
    -0.71
    ociated
    -0.68
    xual
    -0.66
    phabet
    -0.66
    ractive
    -0.65
    roe
    -0.64
    orically
    -0.64
    ession
    -0.64
    Applic
    -0.64
    essee
    -0.63
    POSITIVE LOGITS
    lihood
    0.84
    chool
    0.79
    stead
    0.74
    blog
    0.74
     Forever
    0.73
     abroad
    0.71
    rio
    0.71
     Juliet
    0.70
     vic
    0.69
     ashore
    0.69
    Act Density 0.019%

    No Known Activations