INDEX
Explanations
occurrences of the word "twice" followed by numbers or words
references to the word "twice" or concepts involving repetition
New Auto-Interp
Negative Logits
andr
-0.73
rals
-0.68
league
-0.64
endi
-0.63
uctions
-0.63
lements
-0.63
itives
-0.62
uers
-0.62
orts
-0.61
ulner
-0.60
POSITIVE LOGITS
theless
0.84
consecut
0.82
apiece
0.72
entimes
0.64
aneously
0.64
fold
0.62
consecutive
0.62
yearly
0.62
dozen
0.61
ieth
0.60
Activations Density 0.019%