INDEX
    Explanations

    expressing or indicating

    New Auto-Interp
    Negative Logits
     in
    -0.57
     as
    -0.50
     fully
    -0.46
     *)(
    -0.46
     for
    -0.45
     auténtica
    -0.45
    cend
    -0.45
    ver
    -0.44
    ặn
    -0.43
    完全に
    -0.43
    POSITIVE LOGITS
    StoryboardSegue
    0.84
    Enllaces
    0.69
     geslacht
    0.66
    Autoritní
    0.63
     nakalista
    0.63
     Italijanski
    0.63
    rinfo
    0.62
    tvguidetime
    0.61
     ModelExpression
    0.61
    
    0.60
    Act Density 0.003%

    No Known Activations