INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     V
    -0.98
    -0.95
     VC
    -0.89
     VF
    -0.87
    Vl
    -0.86
    VB
    -0.85
     VT
    -0.84
     VV
    -0.83
     VB
    -0.82
    Vp
    -0.82
    POSITIVE LOGITS
     Ponta
    0.54
     Nerven
    0.52
     assoluto
    0.52
    Q
    0.52
     kirke
    0.51
     häls
    0.50
    umumkan
    0.50
     falschen
    0.50
    getBounding
    0.49
     amélior
    0.49
    Act Density 0.148%

    No Known Activations