INDEX
    Explanations

    phrases indicating someone is new to a platform or subject

    New Auto-Interp
    Negative Logits
    èĨ
    -0.16
    MISS
    -0.15
     acc
    -0.14
     Barton
    -0.14
    665
    -0.14
    mos
    -0.14
    eut
    -0.14
    eph
    -0.14
    monic
    -0.14
     net
    -0.13
    POSITIVE LOGITS
    bish
    0.17
    itere
    0.15
    ürger
    0.15
    oplevel
    0.15
    ling
    0.14
     paddingRight
    0.14
    ÃŃÅĻ
    0.14
    ograd
    0.14
    apons
    0.14
    ienen
    0.14
    Act Density 0.034%

    No Known Activations