INDEX
    Explanations

    game strategies

    New Auto-Interp
    Negative Logits
    žit
    -0.09
    Importance
    -0.08
    opio
    -0.08
    pect
    -0.08
     Importance
    -0.08
    importance
    -0.07
    \d
    -0.07
     transc
    -0.07
     mentorship
    -0.07
    ņem
    -0.07
    POSITIVE LOGITS
     forced
    0.09
     مجبور
    0.09
     хав
    0.08
     compelled
    0.08
     "="
    0.08
    ('.
    0.08
    628
    0.08
    _press
    0.08
    ('.');↵
    0.08
     منا
    0.08
    Act Density 0.033%

    No Known Activations