INDEX
    Explanations

    a repeated expression of "these" followed by various numerical indicators

    New Auto-Interp
    Negative Logits
    ament
    -0.16
    agra
    -0.15
    atha
    -0.15
    ooter
    -0.15
    096
    -0.15
    anto
    -0.14
     Hab
    -0.14
    atti
    -0.14
     εÏĢί
    -0.14
    unn
    -0.14
    POSITIVE LOGITS
    gue
    0.16
    vail
    0.14
    erule
    0.14
    isd
    0.14
    ingleton
    0.14
     zas
    0.14
    gow
    0.13
    goog
    0.13
    UID
    0.13
    коз
    0.13
    Act Density 0.026%

    No Known Activations