INDEX
    Explanations

    entries related to correct and incorrect responses or answers

    New Auto-Interp
    Negative Logits
     Administrativna
    -0.68
    :
    
    -0.66
    .
    
    -0.62
    хьтан
    -0.61
    -0.61
    ):
    
    -0.60
    ();
    
    -0.60
    ());
    
    -0.57
     يتيمه
    -0.57
    ():
    
    -0.57
    POSITIVE LOGITS
    1.75
    0.60
     $\}$
    0.52
     $)$
    0.49
    AndEndTag
    0.48
     '}
    0.48
     Besøkt
    0.47
     }
    0.47
    MessageOf
    0.45
     finnas
    0.44
    Act Density 0.112%

    No Known Activations