INDEX
    Explanations

    sentences with financial or quantitative information

    New Auto-Interp
    Negative Logits
    ,
    -0.23
    :
    -0.18
    1
    -0.17
    130
    -0.16
    ,↵
    -0.16
    Û²
    -0.15
    Û±
    -0.14
    leigh
    -0.14
    180
    -0.14
    cient
    -0.14
    POSITIVE LOGITS
    00
    0.61
    95
    0.48
    50
    0.45
    oo
    0.43
    99
    0.40
    90
    0.39
    80
    0.38
    75
    0.37
    85
    0.35
    60
    0.34
    Act Density 0.032%

    No Known Activations