INDEX
    Explanations

    phrases related to uncertainty or lack of information

    words related to likelihood and the presentation of information or arguments

    New Auto-Interp
    Negative Logits
     glim
    -0.66
    loads
    -0.61
     Been
    -0.60
     goose
    -0.59
     Anyway
    -0.59
    Joined
    -0.56
    packed
    -0.55
     crispy
    -0.55
     laun
    -0.54
    Got
    -0.53
    POSITIVE LOGITS
     cannot
    1.59
     does
    1.48
     did
    1.46
     do
    1.42
    does
    1.27
    do
    1.22
    did
    1.21
     DOES
    1.19
     lacks
    1.13
     DO
    1.13
    Act Density 0.831%

    No Known Activations