INDEX
    Explanations

    mentions of the group Boko Haram and references to Alibaba

    New Auto-Interp
    Negative Logits
    mble
    -0.89
     Yosemite
    -0.71
    Reviewer
    -0.71
    retion
    -0.68
    bnb
    -0.68
    enegger
    -0.67
    FTWARE
    -0.67
     Apollo
    -0.64
    tainment
    -0.63
    eric
    -0.63
    POSITIVE LOGITS
     Haram
    1.56
    zeb
    0.88
    aku
    0.85
    ¯
    0.83
    unin
    0.83
    ju
    0.76
    ame
    0.76
    amous
    0.74
    Ń
    0.74
    ¹
    0.73
    Act Density 0.005%

    No Known Activations