INDEX
Explanations
names of individuals
proper nouns, particularly names
New Auto-Interp
Negative Logits
complete
-0.55
PDATE
-0.55
equality
-0.53
conference
-0.53
category
-0.53
piring
-0.53
REE
-0.53
Newtown
-0.52
Breaker
-0.52
RTX
-0.52
POSITIVE LOGITS
's
1.44
himself
1.24
herself
1.01
Productions
0.99
ÃŃs
0.87
remembers
0.86
Sr
0.85
Jr
0.85
Enterprises
0.82
Himself
0.81
Activations Density 0.232%