INDEX
    Explanations

    occurrences of the word "introduction."

    New Auto-Interp
    Negative Logits
    obe
    -0.15
    uela
    -0.15
    Sharper
    -0.15
     Frontier
    -0.15
    oen
    -0.14
    inki
    -0.14
    vae
    -0.14
    во
    -0.14
    antan
    -0.14
    ova
    -0.14
    POSITIVE LOGITS
    ductory
    0.20
    ezi
    0.15
    olley
    0.15
    strup
    0.15
    EMS
    0.15
    ży
    0.15
    emarks
    0.14
    chyb
    0.14
    aliz
    0.14
    optgroup
    0.14
    Act Density 0.019%

    No Known Activations