INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hwy
    -0.07
    ти
    -0.07
     business
    -0.07
    works
    -0.07
    Y
    -0.06
    ------+------+
    -0.06
     jail
    -0.06
     conductor
    -0.06
     School
    -0.06
     Σχ
    -0.06
    POSITIVE LOGITS
    (pg
    0.07
    dragon
    0.06
     crawling
    0.06
    _rc
    0.06
    rası
    0.06
     RECE
    0.06
    _domains
    0.06
     scraping
    0.06
     Phrase
    0.06
     hippoc
    0.06
    Act Density 0.011%

    No Known Activations