INDEX
    Explanations

    phrases that introduce alternatives or comparisons

    New Auto-Interp
    Negative Logits
    ittel
    -0.18
    using
    -0.16
    etÃŃ
    -0.16
    ercial
    -0.16
    ammen
    -0.15
    ÙĪÛĮÙĨت
    -0.15
     adipiscing
    -0.15
    psc
    -0.14
    ITTE
    -0.14
    ÃŃna
    -0.14
    POSITIVE LOGITS
    agan
    0.18
    isan
    0.18
    s
    0.17
     Fair
    0.15
    ally
    0.15
    acular
    0.15
    ator
    0.15
    acy
    0.15
    ant
    0.15
     fair
    0.15
    Act Density 0.014%

    No Known Activations