INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    psons
    -0.69
    fought
    -0.65
    adeloupe
    -0.64
    /***/
    -0.63
    ɵɵelementEnd
    -0.60
    jutnya
    -0.59
    authier
    -0.59
     různ
    -0.58
    strips
    -0.57
     Beatty
    -0.57
    POSITIVE LOGITS
     unless
    2.49
    unless
    2.35
     Unless
    2.34
    Unless
    2.33
    除非
    1.51
     kecuali
    0.89
     necessariamente
    0.86
    LESS
    0.85
    InputBorder
    0.81
     without
    0.79
    Act Density 0.039%

    No Known Activations