INDEX
    Explanations

    references to personal experiences and emotions

    New Auto-Interp
    Negative Logits
    ".
    
    -0.88
    /−
    -0.86
     Мексичка
    -0.84
    SharedDtor
    -0.81
    .•
    -0.81
    "}>
    -0.81
    ."],
    -0.80
    %";
    -0.80
     NSCoder
    -0.80
    . 
    -0.79
    POSITIVE LOGITS
     lol
    1.91
     haha
    1.85
     LOL
    1.69
     hehe
    1.68
     ;)
    1.64
     hahaha
    1.60
     :)
    1.60
    lol
    1.59
     Haha
    1.59
     ;-)
    1.49
    Act Density 0.663%

    No Known Activations