INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ACLU
    0.51
     Gandh
    0.48
    0.46
     bialgebras
    0.46
    0.46
     Tjiwarl
    0.45
     paralysie
    0.44
     Insight
    0.43
     Inputs
    0.43
     idealism
    0.43
    POSITIVE LOGITS
     meat
    1.32
    🥩
    1.26
    meat
    1.21
     meats
    1.19
    🍖
    1.19
     মাংস
    1.15
    Meat
    1.14
     butcher
    1.10
    1.10
     मांस
    1.07
    Act Density 0.192%

    No Known Activations