INDEX
    Explanations

    text formatting and specific characters

    numerical values and measurements

    New Auto-Interp
    Negative Logits
    ury
    -0.68
     curator
    -0.67
    hement
    -0.67
     Dick
    -0.63
     Newsp
    -0.62
    wright
    -0.62
     veter
    -0.62
    terday
    -0.61
     intent
    -0.61
     Stew
    -0.61
    POSITIVE LOGITS
    ccording
    0.89
    Reviewed
    0.83
    BuyableInstoreAndOnline
    0.83
    é»Ĵ
    0.82
    Minecraft
    0.79
    20439
    0.78
    onnaissance
    0.78
    ãĥĺãĥ©
    0.77
    ãĥ©ãĥ³
    0.75
    displayText
    0.74
    Act Density 0.112%

    No Known Activations