INDEX
    Explanations

    indicators of false responses or inaccuracies in information

    New Auto-Interp
    Negative Logits
     Wikimedijinoj
    -0.58
    DebuggerNonUser
    -0.56
     或
    -0.51
     تعدى
    -0.50
     definitely
    -0.49
     or
    -0.48
     bete
    -0.47
    Якщо
    -0.46
    Chham
    -0.45
    Према
    -0.45
    POSITIVE LOGITS
    BeginInit
    0.84
     שוליים
    0.64
    ]');
    0.64
    ]='\
    0.64
    >';
    
    0.62
    ](#
    0.62
     فريبيس
    0.62
    новништво
    0.61
     discusses
    0.61
    ISupport
    0.60
    Act Density 0.058%

    No Known Activations