INDEX
    Explanations

    phrases containing the word "exactly" followed by a number

    New Auto-Interp
    Negative Logits
    rift
    -0.81
    ker
    -0.74
    ework
    -0.73
    itiz
    -0.68
     respectfully
    -0.65
    kers
    -0.64
    asta
    -0.64
    strong
    -0.64
    tein
    -0.63
     enthusiastically
    -0.62
    POSITIVE LOGITS
     opposite
    0.85
    ãĤ¨
    0.76
    itude
    0.69
    æ©Ł
    0.65
    actly
    0.63
     replicate
    0.62
     same
    0.62
    ãĥ¯
    0.62
     tuned
    0.62
     identical
    0.62
    Act Density 0.358%

    No Known Activations