INDEX
Explanations
phrases related to experiences and accomplishments
expressions related to enjoyment and significant life experiences
New Auto-Interp
Negative Logits
ults
-0.72
olitics
-0.72
vae
-0.69
state
-0.67
geist
-0.66
izes
-0.65
ENCE
-0.65
ucer
-0.65
cedes
-0.64
Gi
-0.64
POSITIVE LOGITS
navigating
1.65
figuring
1.54
photograp
1.50
designing
1.50
organising
1.47
extracting
1.46
constructing
1.43
owning
1.40
discovering
1.39
assembling
1.39
Activations Density 0.794%