INDEX
    Explanations

    technical academic texts

    New Auto-Interp
    Negative Logits
     gay
    -0.08
    -0.08
     prachtige
    -0.08
    Vintage
    -0.08
     glorious
    -0.08
     breast
    -0.08
    疯狂
    -0.08
     vrouwen
    -0.07
     vintage
    -0.07
     hals
    -0.07
    POSITIVE LOGITS
     navigation
    0.09
     Boll
    0.09
     вып
    0.09
     Técnico
    0.08
     تجا
    0.08
    containers
    0.08
     اند
    0.08
     فرو
    0.07
     paved
    0.07
    navigation
    0.07
    Act Density 0.000%

    No Known Activations