INDEX
    Explanations

    references to authors and their works

    New Auto-Interp
    Negative Logits
    гов
    -0.15
    izona
    -0.14
    cai
    -0.14
    íĿ¥
    -0.14
    ToLocal
    -0.14
    .flink
    -0.13
    enu
    -0.13
    ãĤ¿ãĥ¼
    -0.13
    ì°½
    -0.13
    ÄĮesk
    -0.13
    POSITIVE LOGITS
     Did
    0.33
     Martial
    0.30
     Virgin
    0.29
     Christ
    0.28
     Bapt
    0.28
     Domin
    0.28
    Did
    0.27
     Math
    0.26
     Cloth
    0.26
    Virgin
    0.25
    Act Density 0.051%

    No Known Activations