INDEX
    Explanations

    phrases indicating uncertainty or speculation, often starting with "Who knows"

    expressions of uncertainty or curiosity

    New Auto-Interp
    Negative Logits
    ItemTracker
    -0.77
    ciating
    -0.77
    phrine
    -0.76
    herent
    -0.66
    inance
    -0.66
    etsk
    -0.64
    REM
    -0.62
    ructose
    -0.61
    packages
    -0.61
    charges
    -0.61
    POSITIVE LOGITS
     how
    0.70
     darn
    0.70
     fri
    0.67
    how
    0.67
     whats
    0.65
     scen
    0.64
     srfAttach
    0.63
     geop
    0.61
    lege
    0.61
    ãĤ½
    0.60
    Act Density 0.030%

    No Known Activations