INDEX
    Explanations

    mentions of being physically located or situated at a specific place

    New Auto-Interp
    Negative Logits
    formance
    -0.75
    selves
    -0.72
    aceutical
    -0.70
    vous
    -0.68
    PH
    -0.66
    manship
    -0.65
    iframe
    -0.65
    м
    -0.64
    gans
    -0.63
    alth
    -0.63
    POSITIVE LOGITS
     home
    1.01
     least
    0.97
    onement
    0.89
     logger
    0.87
     liberty
    0.81
     Disneyland
    0.79
     Costco
    0.77
     boarding
    0.75
     sea
    0.74
     Hogwarts
    0.74
    Act Density 0.150%

    No Known Activations