INDEX
    Explanations

    phrases that express affirmation or correctness

    New Auto-Interp
    Negative Logits
     “
    -0.50
     [
    -0.45
     ”
    -0.44
     جهت
    -0.43
     post
    -0.43
    ække
    -0.41
     /
    -0.41
     (
    -0.40
     milieux
    -0.40
     demo
    -0.40
    POSITIVE LOGITS
    AccessorTable
    0.89
     Signalez
    0.87
    Personensuche
    0.86
     Paglinawan
    0.86
    تقاوى
    0.86
     المعيارى
    0.85
    GEBURTSDATUM
    0.84
    principalColumn
    0.80
     kasarigan
    0.79
    脚注の使い方
    0.78
    Act Density 0.196%

    No Known Activations