INDEX
    Explanations

    frequent conjunctions in the text

    New Auto-Interp
    Negative Logits
    ilda
    -0.18
    562
    -0.16
     block
    -0.16
    hin
    -0.15
    å¯
    -0.15
    olk
    -0.15
    uteur
    -0.14
    ERO
    -0.14
    ÏĦη
    -0.14
     tin
    -0.14
    POSITIVE LOGITS
    exus
    0.16
    lesi
    0.14
     Policies
    0.14
    agged
    0.13
    ovie
    0.13
    anus
    0.13
     Söz
    0.13
     amat
    0.13
    itas
    0.13
    opa
    0.13
    Act Density 0.002%

    No Known Activations