INDEX
    Explanations

    phrases indicating causation or potential outcomes

    New Auto-Interp
    Negative Logits
    antMatchers
    -0.65
     للمعارف
    -0.64
    ArrowToggle
    -0.63
     beginnetje
    -0.61
    ValueGenerated
    -0.58
     AssemblyCompany
    -0.57
    GeneratedCode
    -0.55
     mergeFrom
    -0.54
    stości
    -0.52
     '\\;'
    -0.51
    POSITIVE LOGITS
     idea
    2.08
     notion
    1.91
     fact
    1.70
    idée
    1.50
    idea
    1.48
     concept
    1.47
     ideia
    1.43
     possibility
    1.40
     idée
    1.40
     assumption
    1.33
    Act Density 0.605%

    No Known Activations