INDEX
Explanations
references to the number "two" in various contexts
references to the number two in various contexts
New Auto-Interp
Negative Logits
IAS
-0.81
brance
-0.80
urden
-0.76
atown
-0.74
¶æ
-0.72
ISTORY
-0.70
imester
-0.70
displayText
-0.69
LEASE
-0.69
erver
-0.69
POSITIVE LOGITS
distinct
1.20
different
1.19
separate
1.19
disparate
1.10
identical
1.05
strangers
0.98
contrasting
0.97
dudes
0.96
halves
0.96
pairs
0.95
Activations Density 0.192%