INDEX
    Explanations

    political campaigns

    New Auto-Interp
    Negative Logits
    ospital
    -0.07
    temperature
    -0.06
    -0.06
    空间
    -0.06
    hospital
    -0.06
    360
    -0.06
    الح
    -0.06
    льт
    -0.06
    ajaran
    -0.06
     submarine
    -0.06
    POSITIVE LOGITS
     yapılan
    0.07
     chois
    0.07
     fds
    0.07
     Liebe
    0.06
     downwards
    0.06
     případ
    0.06
    0.06
    ěstí
    0.06
     ketogenic
    0.06
     anda
    0.06
    Act Density 0.018%

    No Known Activations