INDEX
    Explanations

    instances of the word "the" and its variations in context, indicating a focus on definite articles

    New Auto-Interp
    Negative Logits
    بوابة
    -0.83
    InputBorder
    -0.73
     useCallback
    -0.70
    rawDesc
    -0.69
     חיצוניים
    -0.68
    OptionsMenu
    -0.67
     onOptions
    -0.67
     pitié
    -0.66
     veu
    -0.64
    Hentet
    -0.63
    POSITIVE LOGITS
     with
    1.22
    with
    1.11
     With
    1.07
     WITH
    1.05
    WITH
    0.98
     avec
    0.96
    With
    0.96
    dengan
    0.83
     dengan
    0.76
     Avec
    0.74
    Act Density 0.246%

    No Known Activations