INDEX
    Explanations

    instances of the word "only" to emphasize limitations or exclusivity

    New Auto-Interp
    Negative Logits
    bourg
    -0.15
    antt
    -0.14
    ypes
    -0.14
     пÑĥÑĤ
    -0.14
    stry
    -0.14
    ldkf
    -0.14
    arges
    -0.13
    lement
    -0.13
     Petty
    -0.13
    ant
    -0.13
    POSITIVE LOGITS
    udo
    0.17
    ìķĻ
    0.14
    oth
    0.14
     ем
    0.14
    .Constraint
    0.14
    fans
    0.14
    beg
    0.14
    armac
    0.14
    heim
    0.14
    ament
    0.14
    Act Density 0.076%

    No Known Activations