INDEX
    Explanations

    demonstrative pronouns, particularly "this" and "these."

    New Auto-Interp
    Negative Logits
     Agamemnon
    -0.75
     Coff
    -0.69
     Lw
    -0.69
    ResponseDto
    -0.68
    Moe
    -0.68
     Moe
    -0.66
     Dubuque
    -0.66
     Verbs
    -0.66
    orszá
    -0.65
    enderror
    -0.65
    POSITIVE LOGITS
     this
    2.16
    this
    1.96
     THIS
    1.93
    THIS
    1.81
    This
    1.71
     This
    1.70
     questa
    1.46
     questo
    1.42
     dieses
    1.41
     esta
    1.37
    Act Density 0.365%

    No Known Activations