INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    mittel
    -0.08
    orrelation
    -0.08
    ibor
    -0.08
    viso
    -0.08
    orpen
    -0.08
    ployee
    -0.08
    ochemical
    -0.07
    or
    -0.07
    vector
    -0.07
    ployees
    -0.07
    POSITIVE LOGITS
     Ergeb
    0.08
     chicken
    0.08
     кроме
    0.08
    _Save
    0.08
     wata
    0.08
    0.07
     þar
    0.07
    God
    0.07
     kam
    0.07
     collapsed
    0.07
    Act Density 0.002%

    No Known Activations