INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     Voj
    -0.07
    QUIT
    -0.06
     ethers
    -0.06
     rit
    -0.06
    concat
    -0.06
     Rit
    -0.06
     Yo
    -0.06
     Obt
    -0.06
    arı
    -0.06
    POSITIVE LOGITS
     School
    0.14
     school
    0.14
    School
    0.13
     schools
    0.13
     Schools
    0.11
     SCHOOL
    0.10
    school
    0.09
    -school
    0.09
    _school
    0.08
     schooling
    0.08
    Act Density 0.030%

    No Known Activations