INDEX
Explanations
references to communication channels in various contexts
New Auto-Interp
Negative Logits
es
-0.69
swiper
-0.69
Mortimer
-0.66
י
-0.66
ydi
-0.65
slow
-0.62
y
-0.62
sourire
-0.62
McK
-0.61
ES
-0.61
POSITIVE LOGITS
channels
1.67
Channels
1.54
channels
1.53
Channels
1.52
channel
1.46
Channel
1.43
CHANNEL
1.39
CHANNEL
1.31
Channel
1.28
channe
1.28
Activations Density 0.070%