INDEX
    Explanations

    mentions of specific ideas or concepts stated with emphasis or certainty

    New Auto-Interp
    Negative Logits
    izont
    -0.78
    ãĥ³ãĤ¸
    -0.76
    izens
    -0.75
    greg
    -0.74
    sts
    -0.69
    ãĥ¼ãĥĨ
    -0.69
    ãĥ¯ãĥ³
    -0.69
    ulner
    -0.69
    imaru
    -0.68
    apolis
    -0.67
    POSITIVE LOGITS
     happens
    1.37
     happened
    1.21
     occurs
    1.15
     translates
    1.04
     happen
    1.00
     proves
    0.94
     occurred
    0.94
     applies
    0.91
     coincides
    0.90
     settles
    0.90
    Act Density 0.069%

    No Known Activations