INDEX
    Explanations

    multiple instances of the word "this" or "these" in a text

    New Auto-Interp
    Negative Logits
    LookAnd
    -0.72
     feroit
    -0.64
    adl
    -0.60
    FieldNumber
    -0.56
    ◆◇
    -0.55
    ABUL
    -0.52
     ſever
    -0.52
    AllowUser
    -0.51
     TLR
    -0.51
    msgTypes
    -0.51
    POSITIVE LOGITS
     this
    1.05
     THIS
    1.02
    this
    0.98
    This
    0.97
     This
    0.97
    THIS
    0.90
     questa
    0.78
     этой
    0.75
     these
    0.73
     dieser
    0.73
    Act Density 0.372%

    No Known Activations