INDEX
Explanations
names and biographical information about individuals
instances of individuals associated with specific actions or characteristics
New Auto-Interp
Negative Logits
unbeliev
-0.71
reating
-0.69
ivating
-0.69
guiName
-0.67
pattern
-0.65
urers
-0.62
commit
-0.62
acceptable
-0.61
enticing
-0.61
.–
-0.60
POSITIVE LOGITS
herself
0.83
stint
0.73
unsuccessfully
0.72
çīĪ
0.72
retiring
0.72
)]
0.72
oversee
0.71
consulted
0.68
chaired
0.68
pseudonym
0.68
Activations Density 0.317%