INDEX
Explanations
mentions of the name "Ken" with varying levels of activation
instances of the name "Ken"
New Auto-Interp
Negative Logits
pestic
-0.73
exha
-0.72
pmwiki
-0.71
NRS
-0.71
ENCY
-0.69
accompan
-0.67
seeker
-0.66
ADS
-0.66
Positive
-0.66
Bahá
-0.65
POSITIVE LOGITS
ken
1.29
burgh
1.01
azi
0.93
ners
0.91
wich
0.91
feld
0.90
kens
0.87
igans
0.87
berry
0.86
nect
0.86
Activations Density 0.005%