INDEX
    Explanations

    expressions of laughter or humor

    laughter and amusement sounds

    New Auto-Interp
    Negative Logits
     kasarigan
    -0.73
    ]")]
    -0.73
    __':
    
    -0.68
    __":
    
    -0.68
    "]);
    
    -0.68
    AsUp
    -0.64
    ")));
    
    -0.63
    +#+#
    -0.61
    atchewan
    -0.60
    '));
    
    -0.60
    POSITIVE LOGITS
    Haha
    0.69
     haha
    0.67
    hahaha
    0.65
    haha
    0.63
    Hahaha
    0.60
     Haha
    0.60
     laughing
    0.59
     tertawa
    0.59
     jajaja
    0.58
     laugh
    0.57
    Act Density 0.005%

    No Known Activations