INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     estekak
    -0.62
    NameInMap
    -0.60
    tableFuture
    -0.56
     ddelwed
    -0.54
     onCreateView
    -0.54
    Rptr
    -0.54
    ReusableCell
    -0.54
    newtheorem
    -0.53
     laſſen
    -0.53
    onacci
    -0.52
    POSITIVE LOGITS
     black
    2.06
    black
    2.00
     BLACK
    1.57
    Black
    1.57
    BLACK
    1.53
     Black
    1.50
     blacks
    1.34
     schwarze
    1.22
    1.20
     blackness
    1.20
    Act Density 0.006%

    No Known Activations