INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.62
     it
    -0.55
     offers
    -0.54
     in
    -0.52
     its
    -0.52
    '
    -0.52
     shook
    -0.50
     offer
    -0.50
     made
    -0.48
     done
    -0.48
    POSITIVE LOGITS
     виправивши
    0.91
    LookAnd
    0.80
    findpost
    0.80
     lenker
    0.77
    IntoConstraints
    0.73
     nahilalakip
    0.72
     autorytatywna
    0.65
     Normdatei
    0.65
    fjspx
    0.64
     externi
    0.63
    Act Density 0.016%

    No Known Activations