INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    akh
    -0.16
    velle
    -0.16
    tam
    -0.15
     Ãľst
    -0.15
    bben
    -0.14
     dame
    -0.14
    ëĤľ
    -0.14
    EB
    -0.14
    eb
    -0.14
     Calling
    -0.14
    POSITIVE LOGITS
    .com
    0.40
    .COM
    0.28
    .edu
    0.27
     com
    0.23
    .org
    0.21
    =com
    0.20
    .invalid
    0.18
    .gmail
    0.17
    .gov
    0.17
    com
    0.16
    Act Density 0.007%

    No Known Activations