INDEX
    Explanations

    instances of the word "this" in various contexts

    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.02
    2:0.11
    3:0.06
    4:0.18
    5:0.03
    6:0.11
    7:0.17
    8:0.04
    9:0.05
    10:0.07
    11:0.07
    Negative Logits
     Bridges
    -1.43
    accompanied
    -1.36
    acht
    -1.32
     devoted
    -1.29
    Publisher
    -1.29
    naire
    -1.26
     Achievement
    -1.23
    AMA
    -1.22
     Transparency
    -1.22
    adian
    -1.21
    POSITIVE LOGITS
    opolis
    1.55
    umph
    1.46
     regenerate
    1.41
    onga
    1.40
     fray
    1.40
     pse
    1.37
    iframe
    1.35
    ople
    1.35
    inus
    1.34
     reckoning
    1.33
    Act Density 0.001%

    No Known Activations