INDEX
    Explanations

    prepositions commonly used in various contexts

    New Auto-Interp
    Negative Logits
    -1.35
     يتيمه
    -1.22
    <unused52>
    -1.17
    <unused79>
    -1.16
    <unused68>
    -1.16
    <unused14>
    -1.16
    <unused8>
    -1.16
    <unused16>
    -1.16
    <unused3>
    -1.16
    [@BOS@]
    -1.16
    POSITIVE LOGITS
    .
    0.94
    0.93
    0.81
    1
    0.77
    (
    0.77
    0
    0.73
    2
    0.72
    '
    0.72
    ↵↵
    0.71
    "
    0.68
    Act Density 1.614%

    No Known Activations