INDEX
    Explanations

    terms related to the application of principles or concepts in various contexts

    New Auto-Interp
    Negative Logits
    /umd
    -0.17
    ColumnInfo
    -0.16
    ANCH
    -0.15
    ür
    -0.15
    aln
    -0.15
    iska
    -0.14
    nech
    -0.14
    anch
    -0.14
    aksi
    -0.14
    ovel
    -0.14
    POSITIVE LOGITS
    å±ĭ
    0.16
    929
    0.14
    .measure
    0.14
    consistent
    0.14
    ụ
    0.13
    è°±
    0.13
     critique
    0.13
    292
    0.13
    εÏĨ
    0.13
    858
    0.13
    Act Density 0.068%

    No Known Activations