INDEX
    Explanations

    Ambiguous reasoning scenarios

    New Auto-Interp
    Negative Logits
     Lu
    -0.08
    ############################################################################
    -0.08
    #if
    -0.08
    -0.07
     Không
    -0.07
    _requests
    -0.07
    .Servlet
    -0.07
    ----------------------------------------------------------------
    -0.07
     unfor
    -0.07
     reto
    -0.07
    POSITIVE LOGITS
    wab
    0.08
    Zend
    0.08
    cas
    0.07
     pertains
    0.07
    istri
    0.07
    pera
    0.07
     Damascus
    0.07
    Che
    0.07
     Cardiff
    0.07
     Bosnia
    0.07
    Act Density 0.198%

    No Known Activations