INDEX
    Explanations

    specific names and titles related to academic or professional roles

    New Auto-Interp
    Negative Logits
    askell
    -0.18
    aks
    -0.15
     syn
    -0.14
     célib
    -0.14
     Fusion
    -0.14
    /gcc
    -0.13
    inae
    -0.13
    ascar
    -0.13
     Erect
    -0.13
    ollar
    -0.13
    POSITIVE LOGITS
     preservation
    0.32
     Preservation
    0.30
     ingest
    0.26
     PRES
    0.23
     Wayback
    0.23
    rchive
    0.23
    Pres
    0.23
    pres
    0.22
     born
    0.22
     Preserve
    0.21
    Act Density 0.008%

    No Known Activations