INDEX
    Explanations

    instances of the word "this" and related demonstrative terms

    New Auto-Interp
    Negative Logits
    Moe
    -0.65
     Lawton
    -0.61
    ResponseDto
    -0.60
     Agamemnon
    -0.59
     Coff
    -0.57
     Moe
    -0.57
     Malone
    -0.56
    ाष
    -0.56
     Auvergne
    -0.56
     Verbs
    -0.56
    POSITIVE LOGITS
     this
    1.66
     THIS
    1.65
    THIS
    1.56
    this
    1.49
    This
    1.39
     This
    1.38
    Этот
    1.20
     بهذا
    1.19
     dieses
    1.17
     denna
    1.17
    Act Density 0.364%

    No Known Activations