INDEX
    Explanations

    the word "rather" in various contexts and forms

    New Auto-Interp
    Negative Logits
    s
    -0.17
    rades
    -0.17
    ury
    -0.16
    isphere
    -0.15
    nad
    -0.15
     Anton
    -0.14
    sr
    -0.14
    ilater
    -0.14
     Bulk
    -0.14
    URY
    -0.14
    POSITIVE LOGITS
    ìĦľëĬĶ
    0.17
    indr
    0.17
    oner
    0.16
    »¿
    0.15
    atik
    0.15
    ovÃŃ
    0.15
     rather
    0.15
    rather
    0.15
    abic
    0.14
    .ly
    0.14
    Act Density 0.015%

    No Known Activations