INDEX
    Explanations

    Islam/Muslim

    New Auto-Interp
    Negative Logits
    -0.07
     explor
    -0.07
     food
    -0.07
    ρίας
    -0.06
    -com
    -0.06
     konce
    -0.06
    _co
    -0.06
     щоб
    -0.06
    ीट
    -0.06
    ۸
    -0.06
    POSITIVE LOGITS
     perl
    0.07
    objectManager
    0.07
     Xml
    0.06
     separators
    0.06
    perl
    0.06
    usercontent
    0.06
     ultra
    0.06
    ******
    ↵
    0.06
    sendMessage
    0.06
     POST
    0.06
    Act Density 0.008%

    No Known Activations