INDEX
Explanations
references to communication channels in various contexts
New Auto-Interp
Negative Logits
])));
-0.56
]){
-0.54
*
-0.51
"])
-0.51
'])
-0.51
"]);
-0.50
}))
-0.49
')))
-0.49
}</
-0.48
})`
-0.48
POSITIVE LOGITS
channel
1.63
Channel
1.56
channel
1.54
Channel
1.48
channels
1.47
Channels
1.42
CHANNEL
1.40
channels
1.39
Channels
1.38
CHANNEL
1.31
Activations Density 0.048%