INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    code
    0.45
    iscid
    0.44
    charge
    0.44
    urar
    0.43
    ďaka
    0.43
     noirâtre
    0.43
    if
    0.43
    urai
    0.43
    ytail
    0.42
    abon
    0.42
    POSITIVE LOGITS
     predominance
    0.60
     bevorzug
    0.57
     преимущества
    0.54
     alternatives
    0.54
     advantages
    0.53
     для
    0.52
     предпоч
    0.50
     Advantages
    0.50
     predomin
    0.49
     بیشتر
    0.49
    Act Density 0.120%

    No Known Activations