INDEX
    Explanations

    references to prominent individuals and their actions or statements

    New Auto-Interp
    Negative Logits
     localidad
    -0.37
     réc
    -0.36
     térm
    -0.35
     économ
    -0.34
    зонта
    -0.33
     Waite
    -0.32
    Foreground
    -0.32
    Modern
    -0.32
     ladrillo
    -0.32
     Jerusalén
    -0.32
    POSITIVE LOGITS
    webElement
    0.75
    PMailer
    0.61
     ligiloj
    0.55
     الرياضيه
    0.52
    RectangleBorder
    0.51
     BorderSide
    0.50
    期刊论文
    0.50
     masked
    0.49
    0.48
     serpentine
    0.47
    Act Density 0.052%

    No Known Activations