INDEX
Explanations
the letters "ch" followed by a high activation value
instances of the substring "ch" within words
New Auto-Interp
Negative Logits
INGTON
-0.67
Circus
-0.62
federation
-0.57
Ajax
-0.56
Independence
-0.56
visionary
-0.56
Progress
-0.55
beyond
-0.55
NEY
-0.55
Dear
-0.55
POSITIVE LOGITS
ocobo
1.30
inese
1.30
akra
1.29
icago
1.27
isel
1.27
ieft
1.26
ipping
1.25
ivalry
1.23
attering
1.23
ipped
1.22
Activations Density 0.014%