INDEX
    Explanations

    phrases related to lifting or removing restrictions

    terms related to lifting restrictions or bans

    New Auto-Interp
    Negative Logits
    present
    -0.71
    lished
    -0.70
    errilla
    -0.70
    ãĥ£
    -0.68
     Palin
    -0.64
    915
    -0.63
     Analy
    -0.62
    RAL
    -0.61
    TAG
    -0.60
    clave
    -0.60
    POSITIVE LOGITS
     lift
    1.07
     weights
    1.01
     lifted
    1.00
     lifting
    1.00
     lifts
    0.92
    lift
    0.88
    hens
    0.84
    ĸļ
    0.79
     tremend
    0.79
    weight
    0.78
    Act Density 0.017%

    No Known Activations