INDEX
    Explanations

    phrases indicating initial thoughts or actions

    statements that express initial thoughts or impressions

    New Auto-Interp
    Negative Logits
    etheless
    -0.83
    contin
    -0.78
    sports
    -0.70
    verend
    -0.68
    arians
    -0.68
     notwithstanding
    -0.66
    interrupted
    -0.65
    ichever
    -0.64
    mble
    -0.64
     therein
    -0.63
    POSITIVE LOGITS
     introdu
    0.85
     glance
    0.70
    aceae
    0.66
     hitch
    0.66
    20439
    0.65
     whiff
    0.63
     sensation
    0.62
    ITNESS
    0.62
     introduction
    0.62
    buquerque
    0.61
    Act Density 0.428%

    No Known Activations