INDEX
    Explanations

    articles and determiners in various forms

    New Auto-Interp
    Negative Logits
    principalColumn
    -0.52
     ſtate
    -0.50
     pleaſure
    -0.49
     Majefty
    -0.47
    abstractmethod
    -0.46
    شنبه
    -0.45
     auffi
    -0.45
     ftate
    -0.45
     ſte
    -0.44
     TextInputType
    -0.44
    POSITIVE LOGITS
     einer
    2.13
     một
    2.06
     einen
    2.03
     isang
    2.02
     einem
    2.00
     sebuah
    1.99
     een
    1.96
     eine
    1.96
    一个
    1.87
     یک
    1.84
    Act Density 0.298%

    No Known Activations