INDEX
    Explanations

    words related to communication and explanations

    phrases that indicate clarification or informative communication

    New Auto-Interp
    Negative Logits
     withdrawn
    -0.77
     stunts
    -0.76
    soever
    -0.69
     coerc
    -0.69
     interfered
    -0.68
     risky
    -0.67
    yss
    -0.67
     gamble
    -0.65
     sham
    -0.65
     disobedience
    -0.65
    POSITIVE LOGITS
     concise
    1.37
     succinct
    1.33
     clarity
    1.24
     clarify
    1.20
     clearer
    1.16
     understanding
    1.16
     overview
    1.16
     outline
    1.15
     understand
    1.11
     explaining
    1.09
    Act Density 0.413%

    No Known Activations