INDEX
    Explanations

    phrases indicating contradictions or clarifications

    New Auto-Interp
    Negative Logits
    openg
    -0.40
     legale
    -0.40
    FIS
    -0.38
    OCCURRED
    -0.38
    ต่อ
    -0.38
    nościo
    -0.38
     ainda
    -0.37
    AsUp
    -0.37
    usul
    -0.36
    nabla
    -0.36
    POSITIVE LOGITS
    oredCriteria
    0.81
     oprot
    0.75
    Попис
    0.71
     estekak
    0.71
    Kanpo
    0.69
     propOrder
    0.65
    ukunft
    0.64
     ProtoMessage
    0.63
     NSCoder
    0.63
      (
    0.62
    Act Density 0.337%

    No Known Activations