INDEX
    Explanations

    font-related formatting features in text

    New Auto-Interp
    Negative Logits
     fonts
    -0.53
    Font
    -0.53
     Warenkorb
    -0.52
    ộn
    -0.52
     moeite
    -0.51
    ellschaft
    -0.51
    NUMX
    -0.48
     port
    -0.47
    DoubleQuotes
    -0.46
    ämpfer
    -0.46
    POSITIVE LOGITS
     GenerationType
    0.68
    انيف
    0.63
     Awesome
    0.63
     awesome
    0.62
    weight
    0.62
    family
    0.61
     مشين
    0.60
    awesome
    0.60
     للاسماء
    0.58
    AWESOME
    0.57
    Act Density 0.118%

    No Known Activations