INDEX
    Explanations

    specific entities such as institutions, locations, and organizations

    names of institutions, places, or significant entities

    New Auto-Interp
    Negative Logits
     oneself
    -0.96
     extrad
    -0.71
    theless
    -0.65
    etheless
    -0.65
     sender
    -0.63
     yourself
    -0.63
     deaf
    -0.63
     muted
    -0.63
     Hond
    -0.63
     USPS
    -0.62
    POSITIVE LOGITS
    ometown
    0.83
     Address
    0.79
     Advisory
    0.79
    Wallet
    0.77
     Crimes
    0.75
     Series
    0.75
    Series
    0.73
    ertation
    0.72
     Strategy
    0.72
    irlfriend
    0.72
    Act Density 0.323%

    No Known Activations