INDEX
    Explanations

    numerical values presented in a fraction format

    New Auto-Interp
    Negative Logits
     
    -0.15
     Johnny
    -0.14
    FB
    -0.14
     gains
    -0.14
     tortured
    -0.14
     Yet
    -0.14
    ola
    -0.13
    ên
    -0.13
    æĭ
    -0.13
     mu
    -0.13
    POSITIVE LOGITS
    .infinity
    0.16
    ofday
    0.16
    .vaadin
    0.16
    reich
    0.15
    âĹıâĹı
    0.15
    айÑĤ
    0.15
    _quant
    0.15
    iban
    0.15
     اÙĦÙĨÙĩ
    0.15
    noinspection
    0.15
    Act Density 0.010%

    No Known Activations