INDEX
    Explanations

    phrases indicating logical proof or derivation steps

    proof definitions and explanations

    New Auto-Interp
    Negative Logits
     defaultstate
    -0.70
    ſelves
    -0.68
     للمعارف
    -0.67
     ſche
    -0.65
     queſta
    -0.62
    ckså
    -0.61
    Datuak
    -0.61
     pleaſure
    -0.61
    ThroughAttribute
    -0.61
     ujednoznacz
    -0.59
    POSITIVE LOGITS
    First
    0.45
     First
    0.43
    0.39
    The
    0.38
     For
    0.37
     Let
    0.36
     ordinaria
    0.35
     By
    0.34
    首先
    0.34
     planten
    0.34
    Act Density 0.061%

    No Known Activations