INDEX
    Explanations

    mentions of Nigeria and its derivatives

    New Auto-Interp
    Negative Logits
    ought
    -0.15
    avax
    -0.15
    _sg
    -0.15
    etsk
    -0.14
    ipse
    -0.14
    uggage
    -0.14
     ÙģØ§Ø±
    -0.14
    ERG
    -0.14
    enticate
    -0.13
    Ñħови
    -0.13
    POSITIVE LOGITS
     Delta
    0.18
    lum
    0.16
     delta
    0.16
    ati
    0.15
    wins
    0.15
    ischer
    0.15
    /problem
    0.15
    okoj
    0.14
     Twin
    0.14
    /problems
    0.14
    Act Density 0.007%

    No Known Activations