INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     AssemblyCompany
    -0.62
     gezet
    -0.50
     betaal
    -0.49
    atoday
    -0.48
     doPost
    -0.48
    LookAnd
    -0.48
    doi
    -0.47
     fileSize
    -0.47
     Schicksal
    -0.47
     gehad
    -0.46
    POSITIVE LOGITS
    Green
    1.20
     Green
    1.16
     green
    1.16
     GREEN
    1.09
    green
    1.09
    GREEN
    1.08
     💚
    0.86
     greens
    0.85
    Greene
    0.83
    💚
    0.81
    Act Density 0.083%

    No Known Activations