INDEX
Explanations
phrases related to the number "two"
the repetition of the word "two."
New Auto-Interp
Negative Logits
ugu
-0.82
asta
-0.76
amaru
-0.72
Caption
-0.69
lus
-0.68
ashtra
-0.68
rir
-0.67
ICLE
-0.65
ULE
-0.63
needs
-0.63
POSITIVE LOGITS
thirds
1.60
dozen
1.13
hundred
1.06
weeks
1.04
halves
1.02
fold
0.99
teenth
0.95
decades
0.88
teen
0.88
een
0.88
Activations Density 0.129%