INDEX
    Explanations

    phrases affirming statements or knowledge

    instances of the word "this"

    New Auto-Interp
    Negative Logits
    Ń·
    -0.80
    »Ĵ
    -0.78
    ickets
    -0.78
    pps
    -0.77
    BuyableInstoreAndOnline
    -0.75
    ARS
    -0.74
    veyard
    -0.74
    å§«
    -0.72
    oking
    -0.71
    aughtered
    -0.71
    POSITIVE LOGITS
     trope
    1.09
     phenomenon
    1.04
     applies
    1.00
     week
    0.97
     discrepancy
    0.94
     happens
    0.91
     tactic
    0.88
     pecul
    0.88
     possibility
    0.88
     behaviour
    0.88
    Act Density 0.085%

    No Known Activations