INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    wicklung
    -0.07
     rib
    -0.07
    ัปดาห
    -0.06
     Bilim
    -0.06
    -div
    -0.06
    .Argument
    -0.06
     Evolution
    -0.06
     rum
    -0.06
     river
    -0.06
    11
    -0.06
    POSITIVE LOGITS
     contact
    0.18
     Contact
    0.15
     contacts
    0.14
    -contact
    0.13
    Contact
    0.13
    contact
    0.12
     contacting
    0.11
     CONTACT
    0.10
    CONTACT
    0.10
     contacted
    0.10
    Act Density 0.018%

    No Known Activations