INDEX
Explanations
concepts related to attraction and connection
New Auto-Interp
Negative Logits
peria
-0.17
dej
-0.16
ombine
-0.15
uien
-0.15
woff
-0.15
agini
-0.15
riba
-0.14
á»Ļt
-0.14
uler
-0.14
lossen
-0.14
POSITIVE LOGITS
attracted
0.64
drawn
0.64
attraction
0.53
attracts
0.45
attract
0.44
Draw
0.42
grav
0.41
attr
0.40
draw
0.40
attractions
0.40
Activations Density 0.114%