INDEX
    Explanations

    the presence of the word 'a' and its variations in different contexts

    New Auto-Interp
    Negative Logits
     Vereinigte
    -0.40
     nakalista
    -0.38
     δή
    -0.36
    JNIEnv
    -0.35
    plaat
    -0.35
    ("}");
    -0.34
    دانشنامهٔ
    -0.34
    JRadioButton
    -0.34
     Osna
    -0.34
    =$?
    -0.33
    POSITIVE LOGITS
    Após
    0.87
     After
    0.87
    After
    0.82
     fter
    0.81
     Após
    0.81
     Etter
    0.77
    after
    0.75
     after
    0.74
    andafter
    0.74
    Etter
    0.74
    Act Density 0.008%

    No Known Activations