INDEX
Explanations
names of individuals
proper nouns, specifically names of individuals
New Auto-Interp
Negative Logits
captcha
-0.57
LEASE
-0.52
nonviolent
-0.52
cients
-0.51
idle
-0.51
buzzing
-0.50
vanishing
-0.50
').
-0.50
thrilling
-0.49
accomplishment
-0.48
POSITIVE LOGITS
*,
0.94
QC
0.92
,
0.85
!,
0.85
?,
0.84
Jr
0.79
,
0.77
Sr
0.77
,,
0.77
,[
0.76
Activations Density 0.199%