INDEX
    Explanations

    references to decision-making or choice in a context involving gameplay or tasks

    New Auto-Interp
    Negative Logits
    oux
    -0.15
    esen
    -0.13
    -FIRST
    -0.13
     Passing
    -0.13
    ouv
    -0.13
    orney
    -0.13
    še
    -0.13
    .Constraint
    -0.13
    yb
    -0.13
     rizik
    -0.13
    POSITIVE LOGITS
     mid
    0.55
    mid
    0.43
     during
    0.36
     Mid
    0.36
    Mid
    0.35
    éĢĶ
    0.33
     midterm
    0.33
     middle
    0.33
    _mid
    0.33
    during
    0.33
    Act Density 0.224%

    No Known Activations