INDEX
    Explanations

    elements related to specific categories or classifications, particularly in arts and academic contexts

    New Auto-Interp
    Negative Logits
    featureID
    -1.10
     Normdatei
    -0.93
    EDEFAULT
    -0.84
    存于互联网档案馆
    -0.83
     kaarangay
    -0.82
    GenerationType
    -0.82
    styleType
    -0.82
    AnchorStyles
    -0.81
     AssemblyCulture
    -0.80
    ]")]
    -0.80
    POSITIVE LOGITS
    hogy
    0.45
    input
    0.42
    ồm
    0.42
     mówią
    0.42
    joi
    0.40
     stay
    0.40
     Jove
    0.40
    AYE
    0.39
     Ovid
    0.39
     forged
    0.39
    Act Density 0.565%

    No Known Activations