INDEX
Explanations
phrases indicating a quantity or group of items
quantities and variations, particularly focusing on the words "few" and "numerous."
New Auto-Interp
Negative Logits
istan
-0.85
ESE
-0.83
anwhile
-0.77
SPONSORED
-0.75
said
-0.73
ared
-0.72
Constructed
-0.71
ale
-0.70
ARS
-0.70
ARY
-0.70
POSITIVE LOGITS
interesting
1.27
surprises
1.18
ways
1.05
advantages
1.04
modifications
1.04
exciting
1.03
useful
1.03
variations
1.00
notable
0.99
noteworthy
0.99
Activations Density 0.197%