INDEX
    Explanations

    words related to formal or regulatory contexts

    lexical items related to formal or academic contexts

    New Auto-Interp
    Negative Logits
    FFER
    -0.75
    olicy
    -0.74
    yi
    -0.73
    ortium
    -0.73
    hov
    -0.72
    lio
    -0.72
     tremend
    -0.71
    ession
    -0.70
     Chaser
    -0.68
    mble
    -0.67
    POSITIVE LOGITS
    ization
    1.03
    ibur
    1.02
    axy
    0.99
    ized
    0.99
    izing
    0.98
    istic
    0.93
    gebra
    0.91
    pha
    0.89
    gorith
    0.88
    uminum
    0.88
    Act Density 0.028%

    No Known Activations