INDEX
    Explanations

    expressions of amusement and laughter

    New Auto-Interp
    Negative Logits
     ModelExpression
    -0.68
     onCancelled
    -0.66
    ('');
    
    -0.66
    "/",
    -0.63
     بيها
    -0.61
    ?";
    -0.61
    */;
    -0.59
    "},
    
    -0.59
    "],
    
    -0.59
    "):
    
    -0.59
    POSITIVE LOGITS
    haha
    0.78
    lol
    0.76
    LOL
    0.74
     LOL
    0.72
    hahaha
    0.71
    Haha
    0.71
    HAHAHAHA
    0.71
    HAHA
    0.70
     lol
    0.67
    Hahaha
    0.66
    Act Density 0.147%

    No Known Activations