INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     {}\
    -0.07
     Ale
    -0.07
     race
    -0.06
     ale
    -0.06
    جات
    -0.06
     Cort
    -0.06
    abytes
    -0.06
    Widgets
    -0.06
    alık
    -0.06
    POSITIVE LOGITS
     Support
    0.10
     support
    0.09
     supporting
    0.08
     supporter
    0.07
     supported
    0.07
     supports
    0.07
    .person
    0.07
     SUPPORT
    0.07
     Supported
    0.07
     Supporters
    0.07
    Act Density 0.026%

    No Known Activations