INDEX
    Explanations

    expressions related to fear for personal safety

    concerns or fears regarding safety and well-being

    New Auto-Interp
    Negative Logits
    hess
    -0.73
    edin
    -0.70
    ratulations
    -0.68
    oller
    -0.68
    orbit
    -0.64
    ownt
    -0.63
    FORE
    -0.63
    BuyableInstoreAndOnline
    -0.63
    ohn
    -0.62
    loo
    -0.62
    POSITIVE LOGITS
    gotten
    1.16
    bidden
    1.06
     example
    0.91
    Ĥª
    0.83
     sake
    0.79
     instance
    0.75
     them
    0.75
    geries
    0.74
     reasons
    0.73
    ties
    0.72
    Act Density 0.149%

    No Known Activations