INDEX
Explanations
terms related to spatial directions, specifically 'left' and 'right'
references to directional terms and their relationships
New Auto-Interp
Negative Logits
ĸļ
-0.82
aton
-0.72
odium
-0.71
onite
-0.70
acus
-0.70
gat
-0.69
ynt
-0.69
orious
-0.68
wana
-0.67
exempt
-0.67
POSITIVE LOGITS
sided
0.96
hemisphere
0.92
wing
0.92
flank
0.91
aligned
0.83
lateral
0.82
aisle
0.82
side
0.81
handed
0.80
triangles
0.79
Activations Density 0.069%