INDEX
Explanations
names of people and their titles or roles
commas in lists of names and titles
New Auto-Interp
Negative Logits
receptors
-0.71
stereotypes
-0.70
idepress
-0.63
heights
-0.63
dreams
-0.63
othal
-0.62
expectations
-0.61
calendar
-0.61
process
-0.61
subconscious
-0.60
POSITIVE LOGITS
meanwhile
0.86
QC
0.85
Jr
0.82
Sr
0.81
LLP
0.76
MD
0.76
spokeswoman
0.73
bom
0.73
aka
0.72
pictured
0.72
Activations Density 0.225%