INDEX
    Explanations

    statements or claims made by researchers or experts

    New Auto-Interp
    Negative Logits
    abay
    -0.15
    ellas
    -0.15
    аÑĢам
    -0.14
    etty
    -0.14
    ersen
    -0.14
    ÑĨеÑĢ
    -0.14
    erala
    -0.14
    etten
    -0.13
     elektron
    -0.13
    Ñĥ
    -0.13
    POSITIVE LOGITS
     Prof
    0.27
     Professor
    0.25
     Dr
    0.24
     professor
    0.21
     prof
    0.21
    Professor
    0.20
    Dr
    0.18
    Prof
    0.18
     PROF
    0.17
    prof
    0.17
    Act Density 0.205%

    No Known Activations