INDEX
    Explanations

    instances of individuals recognized for their foundational or leading roles in various contexts

    New Auto-Interp
    Negative Logits
    ½æķ°
    -0.15
    ä¸Ŀ
    -0.14
    ritt
    -0.14
    owler
    -0.14
    èĩ£
    -0.14
    enden
    -0.13
    ataire
    -0.13
    341
    -0.13
     subt
    -0.13
    нка
    -0.13
    POSITIVE LOGITS
     brains
    0.39
     driving
    0.39
    brains
    0.32
     brain
    0.31
     force
    0.31
     inst
    0.30
     behind
    0.29
     Driving
    0.28
     architect
    0.28
    brain
    0.26
    Act Density 0.121%

    No Known Activations