INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     plant
    -1.20
     Plant
    -0.98
    Plant
    -0.83
     PLANT
    -0.83
    plant
    -0.78
    erweise
    -0.64
    Зноскі
    -0.63
    PLANT
    -0.60
    Kaynakça
    -0.60
    ITOS
    -0.60
    POSITIVE LOGITS
     axioms
    0.60
     Samaria
    0.60
    bbene
    0.59
    PerformLayout
    0.58
     psychoanalysis
    0.57
     capitals
    0.57
     autoradio
    0.55
     Shakspeare
    0.55
    ConstraintMaker
    0.54
     Englishmen
    0.53
    Act Density 0.198%

    No Known Activations