INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    💕
    0.61
    💖
    0.61
     💞
    0.59
     ค่ะ
    0.57
     fanciful
    0.57
    💝
    0.56
     delightful
    0.55
     💕
    0.55
     darling
    0.54
    素敵
    0.54
    POSITIVE LOGITS
     thằng
    0.60
     brotherhood
    0.59
    🤙
    0.55
    兄弟
    0.54
    buddy
    0.53
     buddy
    0.52
     Deere
    0.50
     Brotherhood
    0.50
     Comanche
    0.50
     Jeep
    0.49
    Act Density 0.001%

    No Known Activations