INDEX
    Explanations

    mathematical concepts and proofs related to contradictions

    New Auto-Interp
    Negative Logits
    å½
    -0.16
    365
    -0.16
     sposób
    -0.14
    ej
    -0.14
    ıb
    -0.14
     noc
    -0.13
    Ñģа
    -0.13
    les
    -0.13
    ibox
    -0.13
     Gra
    -0.13
    POSITIVE LOGITS
    feit
    0.17
    azzo
    0.15
    elix
    0.14
    šet
    0.14
    ledi
    0.14
    _throw
    0.14
    inaire
    0.14
    elry
    0.14
    ival
    0.14
    дÑĥ
    0.13
    Act Density 0.046%

    No Known Activations