INDEX
    Explanations

    variations of the word "will" indicating certainty or expectation

    New Auto-Interp
    Negative Logits
    edu
    -0.15
    oidal
    -0.15
    chet
    -0.15
    ancellable
    -0.15
    istas
    -0.15
    illions
    -0.14
    yon
    -0.14
    elor
    -0.14
    opr
    -0.14
    well
    -0.14
    POSITIVE LOGITS
    iams
    0.38
    iam
    0.34
     be
    0.34
    s
    0.33
    l
    0.33
     likely
    0.30
    IAM
    0.28
    kommen
    0.28
    fully
    0.28
    power
    0.26
    Act Density 0.340%

    No Known Activations