INDEX
    Explanations

    references to conspiracy and criminal activities

    New Auto-Interp
    Negative Logits
    Gratis
    -0.15
    Undefined
    -0.14
    xea
    -0.14
    λÏī
    -0.14
    .DOM
    -0.14
     olsun
    -0.13
    reative
    -0.13
    ennon
    -0.13
    licer
    -0.13
    ModelIndex
    -0.13
    POSITIVE LOGITS
     comp
    0.23
     involvement
    0.22
     involved
    0.20
    ú
    0.19
     conspiracy
    0.18
     planning
    0.18
     plot
    0.18
     coll
    0.17
     master
    0.17
     Planning
    0.17
    Act Density 0.159%

    No Known Activations