INDEX
    Explanations

    expressions of approval or satisfaction

    New Auto-Interp
    Negative Logits
    :",
    -0.79
    )");
    
    -0.78
    ?”.
    -0.78
    :");
    
    -0.77
     whoſe
    -0.77
    ."));
    -0.77
     PEN
    -0.76
    ■■
    -0.76
     Pitman
    -0.75
    )"),
    -0.75
    POSITIVE LOGITS
     aswell
    0.89
     nahilalakip
    0.84
    mybatisplus
    0.75
    väl
    0.71
     also
    0.70
     cũng
    0.64
     Aussi
    0.63
     CreateTagHelper
    0.60
    parsedMessage
    0.59
     fantastique
    0.59
    Act Density 0.057%

    No Known Activations