INDEX
Explanations
words related to ambition and pushing oneself to achieve goals
New Auto-Interp
Negative Logits
SEA
-0.78
cise
-0.74
nces
-0.72
ublic
-0.71
FORE
-0.70
spring
-0.70
Faces
-0.66
/-
-0.66
×Ļ×
-0.66
aced
-0.66
POSITIVE LOGITS
boundaries
1.22
envelope
1.17
wedge
1.02
buttons
1.02
limits
0.94
harder
0.92
onward
0.89
agendas
0.87
button
0.85
pedal
0.84
Activations Density 0.181%