INDEX
Explanations
references to individuals or people
New Auto-Interp
Negative Logits
Idy
-0.84
<<=
-0.82
**/
-0.82
purging
-0.81
()]
-0.81
>\<^
-0.80
hysema
-0.80
$.
-0.80
"]);
-0.78
ConstraintMaker
-0.78
POSITIVE LOGITS
Person
1.52
person
1.51
PERSON
1.43
person
1.43
Person
1.39
Persons
1.38
Persons
1.30
PERSON
1.27
persons
1.24
persons
1.20
Activations Density 0.013%