INDEX
    Explanations

    numerical values, particularly those related to percentages and ratings

    New Auto-Interp
    Negative Logits
    ophon
    -0.79
    ĸļ
    -0.77
    paio
    -0.76
    sein
    -0.71
    phis
    -0.68
    yle
    -0.68
    chio
    -0.65
    icago
    -0.64
    ichick
    -0.64
    ei
    -0.63
    POSITIVE LOGITS
    th
    0.97
    isher
    0.94
    ishers
    0.92
    ishing
    0.92
    %-
    0.91
    00
    0.91
    60
    0.89
    %
    0.89
    50
    0.88
    %:
    0.88
    Act Density 0.044%

    No Known Activations