INDEX
    Explanations

    phrases indicating experience or qualifications

    New Auto-Interp
    Negative Logits
    eters
    -0.16
    haven
    -0.15
    oral
    -0.14
     Gallagher
    -0.14
    acy
    -0.14
    ultiply
    -0.13
     Bernard
    -0.13
    ajÄħc
    -0.13
    åĬ¡
    -0.13
    verting
    -0.13
    POSITIVE LOGITS
    ANGO
    0.16
    raki
    0.15
    ivos
    0.15
    itesse
    0.15
    çε
    0.14
    idores
    0.14
    isman
    0.14
    landa
    0.14
    lava
    0.14
    ,module
    0.14
    Act Density 0.055%

    No Known Activations