INDEX
    Explanations

    phrases related to granting permission or enabling actions

    phrases indicating the ability or capacity to enable actions or functions

    New Auto-Interp
    Negative Logits
    worn
    -0.66
    ko
    -0.64
    email
    -0.63
    boy
    -0.63
    ta
    -0.62
    xon
    -0.60
    ohm
    -0.59
    kind
    -0.58
    wa
    -0.57
     Horton
    -0.57
    POSITIVE LOGITS
    geries
    0.94
     us
    0.89
    Reviewer
    0.88
    hift
    0.84
     users
    0.83
    ories
    0.80
     seamless
    0.80
    iences
    0.78
    awaru
    0.78
    icial
    0.77
    Act Density 0.092%

    No Known Activations