INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Rune
    -0.08
     bullion
    -0.08
    -0.08
     Ilu
    -0.08
     curioso
    -0.08
     Small
    -0.08
     CRUD
    -0.08
     tiny
    -0.08
     Humanities
    -0.08
     бай
    -0.08
    POSITIVE LOGITS
     divorce
    0.13
     amic
    0.12
     heartbreak
    0.12
     devastated
    0.11
     Divorce
    0.11
     మాజీ
    0.11
     dismant
    0.11
     быв
    0.11
     ಮಾಜಿ
    0.10
     goodbye
    0.10
    Act Density 0.051%

    No Known Activations