INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     delicate
    -0.08
     Tottenham
    -0.08
    upa
    -0.08
    tụ
    -0.07
     pertinent
    -0.07
    -0.07
    702
    -0.07
     Manchester
    -0.07
    -0.07
     Ohr
    -0.07
    POSITIVE LOGITS
     Bloody
    0.08
     screaming
    0.08
     ayam
    0.08
     prom
    0.08
     Emotional
    0.08
     gord
    0.08
     betrayed
    0.07
     انگی
    0.07
    comed
    0.07
     வே
    0.07
    Act Density 0.003%

    No Known Activations