INDEX
Explanations
phrases about common human experiences or traits with a focus on uniqueness
phrases indicating possession or existence related to individuals
New Auto-Interp
Negative Logits
inth
-0.82
Newsletter
-0.70
ound
-0.69
etting
-0.69
live
-0.68
edia
-0.68
QUI
-0.68
ãĤº
-0.66
sole
-0.66
cise
-0.65
POSITIVE LOGITS
flaws
0.97
biases
0.88
quirks
0.87
weaknesses
0.84
faults
0.83
differing
0.83
varying
0.83
precon
0.82
strengths
0.82
undergone
0.78
Activations Density 0.183%