INDEX
    Explanations

    phrases indicating positions or roles within organizations and their responsibilities

    New Auto-Interp
    Negative Logits
    ãĤ¤ãĤº
    -0.18
     geschichten
    -0.17
     weiber
    -0.17
    /***/
    -0.16
    agnar
    -0.15
    ÐIJÑĢÑħÑĸв
    -0.15
    Ä±ÅŁÄ±k
    -0.15
     Erotische
    -0.15
     salopes
    -0.15
    æĺŃ
    -0.15
    POSITIVE LOGITS
     
    0.19
     young
    0.16
    607
    0.16
    773
    0.16
    377
    0.16
    l
    0.16
    228
    0.16
    892
    0.16
    i
    0.16
    570
    0.16
    Act Density 0.237%

    No Known Activations