INDEX
Explanations
means, channels, ways of being
New Auto-Interp
Negative Logits
जनक
0.82
jen
0.77
ஜா
0.75
अनुरूप
0.74
nél
0.70
verdade
0.68
nar
0.67
jena
0.67
alert
0.67
</li>
0.66
POSITIVE LOGITS
lens
1.51
lenses
1.32
channels
1.07
clenched
1.00
prisma
1.00
レンズ
0.99
Lens
0.98
means
0.92
រយៈ
0.91
osmosis
0.91
Activations Density 0.179%