INDEX
Explanations
proper nouns starting or containing "Wil" with varying activation for different patterns
variations of the name "Wilbur" and associated names
New Auto-Interp
Negative Logits
zona
-0.80
20439
-0.73
drm
-0.70
uctive
-0.70
UID
-0.69
ccording
-0.69
htaking
-0.67
hiba
-0.67
è¦ļéĨĴ
-0.66
MAT
-0.66
POSITIVE LOGITS
stad
0.95
Webb
0.90
fleet
0.75
beck
0.72
Howell
0.69
Anderson
0.68
burg
0.68
Clark
0.68
Weld
0.67
Wales
0.66
Activations Density 0.106%