INDEX
    Explanations

    various forms of punctuation, particularly quotation marks and hashtags

    New Auto-Interp
    Negative Logits
     moder
    -0.70
     signaled
    -0.69
     respons
    -0.68
     conflicted
    -0.68
     âĸº
    -0.66
     refreshing
    -0.66
     bias
    -0.66
     elic
    -0.65
     flex
    -0.65
     additionally
    -0.64
    POSITIVE LOGITS
    whatever
    1.45
    etc
    1.38
    soDeliveryDate
    1.22
    sea
    1.09
    beaut
    1.08
    vill
    1.06
    super
    1.06
    short
    1.05
    lon
    1.05
    country
    1.04
    Act Density 0.184%

    No Known Activations