INDEX
    Explanations

    grammar and sentence structure

    New Auto-Interp
    Negative Logits
    -0.08
     favor
    -0.08
    favor
    -0.08
     ROW
    -0.07
    pcb
    -0.07
    Sr
    -0.07
    ROWS
    -0.07
     Favor
    -0.07
    709
    -0.07
    rego
    -0.07
    POSITIVE LOGITS
     sentences
    0.11
    0.11
    Sentence
    0.11
    _sentence
    0.10
     sentence
    0.10
     משפט
    0.10
     subordinate
    0.10
     Sentence
    0.10
     bağlant
    0.09
     বাক
    0.09
    Act Density 0.025%

    No Known Activations