INDEX
    Explanations

    phrases that describe actions and situations involving oneself in various contexts

    prepositions indicating position or situation

    New Auto-Interp
    Negative Logits
    rar
    -0.70
    heny
    -0.68
     Feature
    -0.67
    liament
    -0.67
     sqor
    -0.66
    ï¸ı
    -0.66
    gap
    -0.65
    ctory
    -0.64
    illary
    -0.62
    payer
    -0.61
    POSITIVE LOGITS
    isner
    0.65
     nomine
    0.65
     pant
    0.63
    swer
    0.63
     toile
    0.60
    peror
    0.59
     hunted
    0.59
     Valiant
    0.59
     Diet
    0.58
     selves
    0.58
    Act Density 0.191%

    No Known Activations