INDEX
    Explanations

    phrases or sentences that introduce a new idea or aspect to a conversation

    affirmations or expressions of agreement

    New Auto-Interp
    Negative Logits
    İĭ
    -0.70
     flair
    -0.64
    âĹ¼
    -0.64
    illary
    -0.63
    unal
    -0.62
    MX
    -0.61
     Parables
    -0.60
     wedge
    -0.58
     lightsaber
    -0.58
     garage
    -0.58
    POSITIVE LOGITS
    esley
    0.99
    come
    0.89
    ington
    0.78
    ards
    0.76
    espie
    0.76
    Coin
    0.68
    FTWARE
    0.68
    tenance
    0.67
    STON
    0.67
     suited
    0.67
    Act Density 0.024%

    No Known Activations