INDEX
    Explanations

    statements and questions reflecting correctness and understanding

    assertions or statements where someone claims to be correct or right about something.

    New Auto-Interp
    Negative Logits
    positories
    -0.35
     gea
    -0.33
     circ
    -0.30
    casus
    -0.30
    AxisAlignment
    -0.29
    kär
    -0.29
     Asbury
    -0.29
    trauma
    -0.28
     ComVisible
    -0.28
     inkább
    -0.28
    POSITIVE LOGITS
     correct
    2.63
     wrong
    2.41
     Correct
    2.39
    Correct
    2.39
    correct
    2.36
     incorrect
    2.22
     CORRECT
    2.17
    wrong
    2.09
     Wrong
    2.08
     WRONG
    2.00
    Act Density 0.729%

    No Known Activations