INDEX
    Explanations

    references to specific academic or research citations, particularly in the context of studies or papers

    New Auto-Interp
    Negative Logits
    ربعة
    -0.57
    ...
    -0.54
     możliwe
    -0.54
     Trost
    -0.53
     måte
    -0.52
    bný
    -0.51
    visející
    -0.50
    vábbi
    -0.50
     sledo
    -0.50
    And
    -0.49
    POSITIVE LOGITS
     JAS
    1.42
     Jamb
    1.37
     Jy
    1.33
     Ja
    1.32
     jc
    1.32
     JF
    1.30
     JJ
    1.29
     Jes
    1.29
     JM
    1.29
     JAR
    1.29
    Act Density 0.794%

    No Known Activations