INDEX
Explanations
the word "robot" or related terms
keywords related to robots and robotics
New Auto-Interp
Negative Logits
clair
-0.79
WAYS
-0.78
creen
-0.75
dn
-0.72
emi
-0.71
Beg
-0.70
tg
-0.70
arah
-0.69
lv
-0.68
tin
-0.68
POSITIVE LOGITS
ically
0.85
anical
0.82
swarm
0.76
crawling
0.75
robot
0.72
locom
0.72
arm
0.71
overl
0.70
ocalypse
0.70
bots
0.69
Activations Density 0.035%