INDEX
Explanations
references to individuals' professional accomplishments and roles
New Auto-Interp
Negative Logits
Printf
-0.15
raits
-0.15
embark
-0.14
inee
-0.14
Declared
-0.13
köln
-0.13
urus
-0.13
درÛĮ
-0.13
Typed
-0.13
_userdata
-0.13
POSITIVE LOGITS
lead
0.28
co
0.28
spear
0.27
serve
0.24
served
0.23
help
0.22
serves
0.22
worked
0.22
helped
0.22
leads
0.22
Activations Density 0.534%