INDEX
    Explanations

    numbers, particularly in a mathematical or statistical context

    New Auto-Interp
    Negative Logits
    دانشنامهٔ
    -1.07
    ]='\
    -0.98
    ']))
    
    -0.97
    ,:);
    -0.96
    ']],
    -0.95
    ]');
    -0.93
    ]]:
    -0.93
    }")
    
    -0.92
    <bos>
    -0.91
    ')],
    -0.90
    POSITIVE LOGITS
    2
    2.03
    3
    1.37
    4
    1.30
    1
    1.16
    0
    1.11
    5
    1.07
    6
    1.06
    8
    0.98
    7
    0.95
    nd
    0.90
    Act Density 1.678%

    No Known Activations