INDEX
    Explanations

    words indicating presence in challenging or critical situations

    New Auto-Interp
    Negative Logits
    ilage
    -0.16
    寿
    -0.16
    iber
    -0.16
    /GPL
    -0.15
    ادÙħ
    -0.15
    ibern
    -0.15
     devant
    -0.14
    -regexp
    -0.13
    ederland
    -0.13
    ноз
    -0.13
    POSITIVE LOGITS
    ror
    0.15
    abus
    0.15
    ness
    0.14
    otto
    0.14
    foot
    0.14
    aben
    0.14
    rene
    0.14
    agh
    0.14
    agate
    0.13
    abella
    0.13
    Act Density 0.009%

    No Known Activations