INDEX
    Explanations

    references to opinions, feelings, and subjective assessments

    that followed by specific words

    New Auto-Interp
    Negative Logits
    RenderAtEndOf
    -0.61
    complexContent
    -0.61
    ftagPool
    -0.53
    -0.52
     FetchType
    -0.51
    EndContext
    -0.50
     CURIAM
    -0.50
    Rptr
    -0.50
     ویکی‌پدی
    -0.49
    Географиясе
    -0.49
    POSITIVE LOGITS
    berdayakan
    0.44
    ambilan
    0.41
     orejas
    0.40
     conmigo
    0.40
    jarkan
    0.39
    rektur
    0.38
     vengan
    0.38
    ticias
    0.38
    وردار
    0.38
     desnuda
    0.38
    Act Density 0.063%

    No Known Activations