INDEX
    Explanations

    emoticons, symbols, and markers indicating order or importance in content

    New Auto-Interp
    Negative Logits
    '>{
    -0.71
    "],
    
    -0.69
    ']))
    
    -0.69
    )))));
    -0.68
    "];
    
    -0.68
    "])
    
    -0.64
    )();
    -0.64
    "},
    
    -0.64
    ()};
    -0.64
    "]);
    
    -0.63
    POSITIVE LOGITS
    2.53
     ✨
    1.86
    ✨:
    1.69
    :✨
    1.57
    ✨✨
    1.50
    ⭐️
    1.06
    🌟
    1.04
    💕
    1.02
    💫
    1.02
    💖
    0.95
    Act Density 0.043%

    No Known Activations