INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     автомобиля
    -0.08
     Lions
    -0.08
     ноут
    -0.08
     coronary
    -0.08
     constraints
    -0.07
     scenic
    -0.07
     गर्दै
    -0.07
     overall
    -0.07
     seleccionado
    -0.07
     commas
    -0.07
    POSITIVE LOGITS
     sect
    0.13
     factions
    0.12
     religi
    0.12
     ধর্ম
    0.11
     religiosa
    0.11
     धार्मिक
    0.11
     beliefs
    0.11
     religious
    0.11
     religions
    0.11
     religieux
    0.11
    Act Density 0.077%

    No Known Activations