INDEX
Explanations
words related to spatial positioning and movement
New Auto-Interp
Negative Logits
ancial
-0.79
Bal
-0.73
antine
-0.71
unin
-0.69
cients
-0.68
accompan
-0.68
allo
-0.67
Fourth
-0.67
anos
-0.66
emporary
-0.65
POSITIVE LOGITS
xual
0.71
EMENT
0.71
println
0.69
dden
0.68
ned
0.67
stretched
0.63
loud
0.60
puff
0.59
oeuv
0.58
=\"
0.58
Activations Density 0.043%