INDEX
    Explanations

    instances of speech or citations from individuals

    New Auto-Interp
    Negative Logits
     للاسماء
    -0.87
     ब्रेकडाउन
    -0.84
     disambiguazione
    -0.72
    parsedMessage
    -0.71
    ImageContext
    -0.70
    errHandler
    -0.69
     kaarangay
    -0.69
    ConstraintMaker
    -0.67
    RegressionTest
    -0.67
    ValueStyle
    -0.67
    POSITIVE LOGITS
     mencionar
    0.35
     mencion
    0.31
     répé
    0.31
     sürd
    0.31
     sekali
    0.30
     souverain
    0.29
     gafas
    0.28
     Datenschutzer
    0.28
     romántico
    0.28
     mencionado
    0.28
    Act Density 0.010%

    No Known Activations