INDEX
    Explanations

    different shades and types of colors

    New Auto-Interp
    Negative Logits
     blue
    -0.19
     yellow
    -0.17
     Yellow
    -0.16
     zwarte
    -0.15
     purple
    -0.15
     pink
    -0.15
     redd
    -0.15
    esch
    -0.14
    insics
    -0.14
     YELLOW
    -0.14
    POSITIVE LOGITS
    ãĥįãĥ«
    0.15
     Platt
    0.15
     ÑĥÑģлов
    0.15
    çĽĬ
    0.14
    inish
    0.14
    ¶Į
    0.14
    ç»į
    0.14
    .workflow
    0.14
    ityEngine
    0.13
    ritz
    0.13
    Act Density 0.065%

    No Known Activations