INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     необхідно
    -0.07
    .nc
    -0.06
     Novel
    -0.06
     Zo
    -0.06
    _NAV
    -0.06
     bp
    -0.06
     microscopic
    -0.06
     فرو
    -0.06
     Zar
    -0.06
     Sala
    -0.06
    POSITIVE LOGITS
     Off
    0.08
    Out
    0.07
    0.07
    OFF
    0.07
    off
    0.07
    :^(
    0.07
     coeffs
    0.07
    Off
    0.07
    Away
    0.07
     AIR
    0.07
    Act Density 0.014%

    No Known Activations