INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ://"
    -0.66
     sekä
    -0.53
     past
    -0.53
     newly
    -0.50
     +
    
    -0.50
    "));
    
    -0.49
    cmath
    -0.49
    ждую
    -0.48
     ✓
    -0.48
     bislang
    -0.48
    POSITIVE LOGITS
     anyway
    3.56
    anyway
    3.28
     Anyway
    3.26
    Anyway
    3.14
     anyways
    3.01
     Anyways
    2.63
     anyhow
    2.62
    Anyways
    2.60
    Anyhow
    2.09
     comunque
    1.45
    Act Density 0.091%

    No Known Activations