INDEX
    Explanations

    references to comparisons and contrasts between two or more entities

    New Auto-Interp
    Negative Logits
     AssemblyTitle
    -0.84
     démocr
    -0.66
    +#+#
    -0.65
     Roskov
    -0.62
    ThroughAttribute
    -0.60
     Holman
    -0.58
    riction
    -0.58
     Kinney
    -0.57
    :+:
    -0.56
    ringes
    -0.56
    POSITIVE LOGITS
     davon
    0.58
    これも
    0.58
     chúng
    0.55
    jenige
    0.55
    الإنجليزية
    0.54
     could
    0.54
     antaranya
    0.52
    ModelAndView
    0.51
     would
    0.51
     had
    0.51
    Act Density 0.357%

    No Known Activations