INDEX
    Explanations

    names and variations of the word "cruise"

    New Auto-Interp
    Negative Logits
    ç¿°
    -0.14
    out
    -0.14
    hood
    -0.14
    outed
    -0.14
    393
    -0.14
    ayment
    -0.14
    casts
    -0.13
     pastoral
    -0.13
    arg
    -0.13
    unes
    -0.13
    POSITIVE LOGITS
    ifix
    0.32
    ible
    0.19
    aders
    0.18
    iate
    0.17
     fixes
    0.17
    ifax
    0.17
    IFORM
    0.17
    ISING
    0.17
    ising
    0.16
    fix
    0.16
    Act Density 0.008%

    No Known Activations