INDEX
    Explanations

    introductory phrases and contextual setups in academic or informational texts

    New Auto-Interp
    Negative Logits
     Administrativna
    -0.96
     незавершена
    -0.89
    WriteBarrier
    -0.88
    AndEndTag
    -0.87
     protoimpl
    -0.86
     Italijani
    -0.86
    تقاوى
    -0.85
     Савезне
    -0.85
     للمعارف
    -0.83
    styleType
    -0.83
    POSITIVE LOGITS
    1
    0.28
    [
    0.27
     oscu
    0.26
    alignSelf
    0.24
    ​​
    0.24
    2
    0.24
    <em>
    0.24
    0.24
    0.23
    VStack
    0.23
    Act Density 0.039%

    No Known Activations