INDEX
    Explanations

    words related to transformation or change

    New Auto-Interp
    Negative Logits
    parsedMessage
    -0.60
    RTEE
    -0.55
    AutoField
    -0.52
    featureID
    -0.52
    UrlResolution
    -0.48
    InjectAttribute
    -0.46
     argint
    -0.45
    tagHelperRunner
    -0.45
     وتسجيلات
    -0.44
     esternos
    -0.44
    POSITIVE LOGITS
     trans
    0.86
     Trans
    0.73
    Trans
    0.62
    trans
    0.59
     TRANS
    0.50
     тран
    0.47
    TRANS
    0.45
     atlantic
    0.42
    vese
    0.41
     transgender
    0.40
    Act Density 0.198%

    No Known Activations