INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    väg
    -0.41
     ASA
    -0.40
    そうです
    -0.39
    といけない
    -0.38
     Könige
    -0.37
    INTERNAL
    -0.37
     opérés
    -0.37
    chymal
    -0.37
    -0.36
     even
    -0.36
    POSITIVE LOGITS
    libft
    0.64
    RTLD
    0.63
    wpi
    0.62
     Vikipedi
    0.62
    typeorm
    0.60
    دانشنامهٔ
    0.60
     Normdatei
    0.59
    potent
    0.59
    JNIEnv
    0.59
     oprot
    0.58
    Act Density 0.093%

    No Known Activations