INDEX
    Explanations

    instances of the word "pretend."

    instances of the word "pretend" and its variations

    New Auto-Interp
    Negative Logits
     srf
    -0.69
    ccording
    -0.67
    âĨij
    -0.64
    cutting
    -0.62
    APH
    -0.61
    vez
    -0.60
    hani
    -0.59
    chains
    -0.59
    lean
    -0.59
    hner
    -0.59
    POSITIVE LOGITS
     innocence
    0.78
    ulence
    0.77
    entious
    0.74
     forgot
    0.70
    orial
    0.65
     Moose
    0.64
    ishly
    0.64
    zel
    0.63
    ensions
    0.63
    plane
    0.62
    Act Density 0.016%

    No Known Activations