INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Ans
    -0.07
    165
    -0.07
    Moreover
    -0.07
    163
    -0.07
     cherished
    -0.06
     ponder
    -0.06
     Verse
    -0.06
     يس
    -0.06
    166
    -0.06
    ','#
    -0.06
    POSITIVE LOGITS
     equipment
    0.13
     Equipment
    0.11
     equipos
    0.10
    Equipment
    0.09
    equipment
    0.08
     euth
    0.08
     εγκα
    0.08
     evacuate
    0.08
    team
    0.08
     EQUI
    0.08
    Act Density 0.016%

    No Known Activations