INDEX
Explanations
mentions of the word "Clark" with varying levels of activation
the name "Clark" and its various instances in the text
New Auto-Interp
Negative Logits
choes
-0.94
76561
-0.93
urity
-0.89
ntil
-0.87
awaru
-0.81
orescence
-0.77
URI
-0.75
olitan
-0.73
exempt
-0.73
ItemTracker
-0.71
POSITIVE LOGITS
stown
1.02
Clark
1.01
ston
0.94
Kent
0.87
obyl
0.84
Clark
0.83
Yards
0.82
County
0.82
dale
0.79
anan
0.79
Activations Density 0.005%