INDEX
Explanations
the number "Two" or any variants of it in a text
instances of the word "two" and its variations
New Auto-Interp
Negative Logits
thouse
-0.88
enegger
-0.76
ICLE
-0.74
gerald
-0.73
leneck
-0.70
rade
-0.70
auder
-0.69
statement
-0.69
ablishment
-0.69
ontent
-0.68
POSITIVE LOGITS
thirds
2.03
halves
1.49
dozen
1.21
weeks
1.14
nd
1.12
sides
1.11
tablespoons
1.06
sets
1.02
extremes
1.02
thirds
1.02
Activations Density 0.131%