INDEX
Explanations
instances of individuals recognized for their foundational or leading roles in various contexts
New Auto-Interp
Negative Logits
½æķ°
-0.15
ä¸Ŀ
-0.14
ritt
-0.14
owler
-0.14
èĩ£
-0.14
enden
-0.13
ataire
-0.13
341
-0.13
subt
-0.13
нка
-0.13
POSITIVE LOGITS
brains
0.39
driving
0.39
brains
0.32
brain
0.31
force
0.31
inst
0.30
behind
0.29
Driving
0.28
architect
0.28
brain
0.26
Activations Density 0.121%