INDEX
    Explanations

    instances of the word "this" and related phrases

    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.02
    2:0.28
    3:0.05
    4:0.03
    5:0.04
    6:0.02
    7:0.01
    8:0.27
    9:0.12
    10:0.04
    11:0.01
    Negative Logits
    arov
    -1.28
    ONSORED
    -1.26
    ynski
    -1.25
    ウス
    -1.25
     guessed
    -1.22
     ove
    -1.21
     efforts
    -1.19
    eele
    -1.19
    heny
    -1.18
     attempts
    -1.15
    POSITIVE LOGITS
    DragonMagazine
    1.33
     Jungle
    1.25
    1.21
    rete
    1.21
     Billboard
    1.19
    advertisement
    1.18
    itaire
    1.17
     runway
    1.16
    mares
    1.15
     Gauntlet
    1.15
    Act Density 0.030%

    No Known Activations