INDEX
    Explanations

    commands or instructions in a conversation context

    dialogue that expresses requests or commands

    New Auto-Interp
    Negative Logits
     Flavoring
    -0.79
     seemingly
    -0.76
    etheless
    -0.72
    umerous
    -0.71
    ashington
    -0.71
    astical
    -0.71
    particularly
    -0.71
    respective
    -0.71
    Simply
    -0.69
    rupulous
    -0.69
    POSITIVE LOGITS
     â̦"
    1.26
     yours
    1.13
     ..."
    1.12
    â̦"
    1.10
    ..."
    1.08
     ya
    1.07
     your
    1.05
    !'"
    1.05
     fuckin
    1.04
    ?'"
    1.04
    Act Density 0.507%

    No Known Activations