INDEX
    Explanations

    Singlish and Chinese internet slang

    New Auto-Interp
    Negative Logits
    0.62
    0.58
    Analysis
    0.57
    ليل
    0.57
    0.57
    0.57
     фаразы
    0.55
    0.55
    0.55
    Produto
    0.54
    POSITIVE LOGITS
     
    0.66
    !
    0.62
     banget
    0.61
    0.59
    0.58
     galore
    0.57
     😂
    0.54
     !
    0.54
     k
    0.54
     bang
    0.54
    Act Density 0.014%

    No Known Activations