INDEX
Explanations
the word "ch" with varying levels of activation
instances of the substring "ch"
New Auto-Interp
Negative Logits
ERC
-0.74
GMT
-0.68
corrid
-0.67
©¶æ
-0.66
PDATE
-0.63
berman
-0.62
rall
-0.61
Soy
-0.60
monarch
-0.60
¥µ
-0.59
POSITIVE LOGITS
icago
1.22
ampions
1.19
ambers
1.18
ampion
1.12
inese
1.11
aos
1.08
amps
1.06
ocolate
1.06
amber
1.03
icken
1.02
Activations Density 0.039%