INDEX
Explanations
phrases containing the words "old-fashioned" followed by a variety of different words and undertakings
references to outdated ideologies or styles
New Auto-Interp
Negative Logits
Moreno
-0.75
sinks
-0.74
istg
-0.73
lashes
-0.72
proceeds
-0.71
moder
-0.70
DRAG
-0.70
anca
-0.69
latitude
-0.69
ï¸ı
-0.67
POSITIVE LOGITS
fashioned
1.25
style
1.12
sounding
1.09
angled
1.08
looking
1.04
generation
1.04
day
0.98
derived
0.95
Soviet
0.95
old
0.94
Activations Density 0.078%