INDEX
Explanations
names of famous individuals
New Auto-Interp
Negative Logits
deval
-0.70
overwhelming
-0.68
Compare
-0.66
incon
-0.66
conventions
-0.64
modifiers
-0.63
devices
-0.62
scaven
-0.62
icing
-0.62
conver
-0.62
POSITIVE LOGITS
Jr
1.28
PhD
1.13
Sr
0.97
QC
0.87
Jr
0.84
pseudonym
0.83
aka
0.82
Assistant
0.81
alias
0.81
alias
0.81
Activations Density 1.752%