INDEX
    Explanations

    phrases indicating manipulation or exploitation of circumstances or individuals

    New Auto-Interp
    Negative Logits
    emmel
    -0.19
    agner
    -0.15
    .FontStyle
    -0.14
    ogn
    -0.14
    ãĥ
    -0.14
     padd
    -0.14
    láda
    -0.14
    skyt
    -0.14
    mega
    -0.14
    types
    -0.14
    POSITIVE LOGITS
     available
    0.15
     Candid
    0.15
     expertise
    0.15
    vala
    0.14
    ault
    0.14
     Coast
    0.14
    alth
    0.14
     Studio
    0.14
     Vis
    0.14
     existing
    0.14
    Act Density 0.131%

    No Known Activations