INDEX
    Explanations

    words or phrases suggesting doubt or concern

    terms related to questionable or dubious practices and situations

    New Auto-Interp
    Negative Logits
    eding
    -0.86
    ften
    -0.85
    olon
    -0.84
    OVA
    -0.82
    eded
    -0.78
    hower
    -0.77
    elf
    -0.75
    ynthesis
    -0.74
    hner
    -0.73
    iser
    -0.73
    POSITIVE LOGITS
     questionable
    1.09
     dubious
    1.04
     legality
    0.91
     suspic
    0.77
     foul
    0.70
     spurious
    0.68
     commodities
    0.68
     Mub
    0.67
     Thomson
    0.66
     antiquity
    0.66
    Act Density 0.009%

    No Known Activations