INDEX
    Explanations

    Teams and place names

    New Auto-Interp
    Negative Logits
     soud
    -0.07
     violation
    -0.06
    .pix
    -0.06
    .dc
    -0.06
     librarian
    -0.06
    _near
    -0.06
    .sound
    -0.06
    	cuda
    -0.06
     jeszcze
    -0.06
     Bedroom
    -0.06
    POSITIVE LOGITS
    _tweets
    0.07
    traditional
    0.07
    .coroutines
    0.06
     svg
    0.06
     رئيس
    0.06
    0.06
    .m
    0.06
     framed
    0.06
     RVA
    0.06
     martin
    0.06
    Act Density 0.131%

    No Known Activations