INDEX
    Explanations

    phrases related to disguising or camouflaging

    instances of words related to masquerade or disguise

    New Auto-Interp
    Negative Logits
     RELEASE
    -0.65
     CSV
    -0.63
     NEC
    -0.62
     fertility
    -0.60
     conversions
    -0.59
     RSS
    -0.59
     Pace
    -0.58
     Stern
    -0.58
     Schumer
    -0.58
     Bei
    -0.58
    POSITIVE LOGITS
    quer
    1.10
    querade
    1.03
    ading
    1.00
    ÃŃa
    0.90
    aders
    0.86
    ior
    0.84
    ader
    0.79
    idon
    0.77
    ware
    0.76
    mop
    0.76
    Act Density 0.024%

    No Known Activations