INDEX
Explanations
names of specific individuals
proper nouns, specifically names of individuals and entities
New Auto-Interp
Negative Logits
spring
-0.85
respons
-0.76
ename
-0.75
ceed
-0.72
juven
-0.71
raising
-0.71
uate
-0.70
Crusader
-0.70
oral
-0.70
eatures
-0.70
POSITIVE LOGITS
Wheeler
0.89
\\\\\\\\
0.86
Wheel
0.78
HEAD
0.71
inson
0.70
Merit
0.70
andowski
0.68
shred
0.68
pad
0.68
leigh
0.67
Activations Density 0.025%