INDEX
    Explanations

    phrases related to discussions, opinions, evaluations, and reactions

    punctuation marks, particularly commas, within a text

    New Auto-Interp
    Negative Logits
    ãĤ´ãĥ³
    -1.03
    è£ħ
    -0.87
    çīĪ
    -0.75
    ãģ®å
    -0.74
    ãĥ¯
    -0.74
    éŃĶ
    -0.73
    vre
    -0.70
    %:
    -0.70
    OVA
    -0.68
    ãĤ¨ãĥ«
    -0.67
    POSITIVE LOGITS
     yeah
    1.20
     whereas
    1.08
     because
    1.01
     obviously
    0.99
     frankly
    0.99
     but
    0.98
     blah
    0.96
     which
    0.95
     [
    0.94
     okay
    0.94
    Act Density 0.286%

    No Known Activations