INDEX
Explanations
historical figures and thinkers
New Auto-Interp
Negative Logits
buyer
-1.25
ASSISTANT
-1.00
Employees
-0.98
buyers
-0.93
operator
-0.91
lettes
-0.90
techniques
-0.89
floors
-0.88
patients
-0.88
Buyers
-0.87
POSITIVE LOGITS
thinkers
2.22
writers
2.05
philosopher
1.81
philosophers
1.80
thinker
1.77
poet
1.76
sage
1.75
prophet
1.71
poets
1.64
wise
1.63
Activations Density 0.102%