INDEX
    Explanations

    mentions of the word "use" and its variations in different contexts

    New Auto-Interp
    Negative Logits
    hoff
    -0.72
    si
    -0.68
    owship
    -0.66
    iae
    -0.66
    soon
    -0.64
    cellence
    -0.64
    pton
    -0.64
    watching
    -0.63
    ikhail
    -0.62
    orld
    -0.62
    POSITIVE LOGITS
    fully
    1.09
     condoms
    0.96
     aliases
    0.95
    FUL
    0.89
     contraceptives
    0.86
    ful
    0.86
     computers
    0.85
     them
    0.83
     tools
    0.83
     caution
    0.78
    Act Density 0.101%

    No Known Activations