INDEX
    Explanations

    the presence of phrases indicating connections or comparisons between subjects

    New Auto-Interp
    Negative Logits
     للمعارف
    -0.52
    Geplaatst
    -0.45
     Knoblauch
    -0.43
    ñores
    -0.41
    Construct
    -0.40
     tillegg
    -0.40
     Construct
    -0.40
     męska
    -0.40
     Vors
    -0.39
     paździer
    -0.39
    POSITIVE LOGITS
    featureID
    0.54
    0.53
     CURIAM
    0.52
    reportWebVitals
    0.44
    WindowConstants
    0.43
    TAWA
    0.40
    styleType
    0.39
    __*/
    0.39
    gwt
    0.39
     nct
    0.39
    Act Density 0.931%

    No Known Activations