INDEX
    Explanations

    sentences structured around the subject "it" that contains assessments or claims

    New Auto-Interp
    Negative Logits
    aise
    -0.18
    odont
    -0.17
     themselves
    -0.16
    499
    -0.15
     whom
    -0.15
     à¤īनà¤ķ
    -0.15
     himself
    -0.14
     their
    -0.14
    898
    -0.14
    oints
    -0.14
    POSITIVE LOGITS
     itself
    0.31
     its
    0.30
     Its
    0.27
    Its
    0.26
    its
    0.21
    å®ĥ们
    0.19
    iner
    0.17
    ï¼Įå®ĥ
    0.17
     коÑĤоÑĢое
    0.17
     Ñıке
    0.17
    Act Density 0.199%

    No Known Activations