INDEX
Explanations
instances of the number "two" in close proximity
instances of duality or pairs in various contexts
New Auto-Interp
Negative Logits
heimer
-0.60
fears
-0.58
Schultz
-0.57
icit
-0.56
Hastings
-0.54
cynicism
-0.54
partisan
-0.52
humane
-0.52
demoral
-0.52
retribution
-0.52
POSITIVE LOGITS
apiece
1.07
respective
0.79
optionally
0.73
respectively
0.73
equivalents
0.69
consecut
0.69
interchangeable
0.69
varying
0.68
randomly
0.67
grouped
0.67
Activations Density 1.492%