INDEX
Explanations
phrases that mention the word "both" followed by some common theme or concept
references to the concept of "both" in various contexts
New Auto-Interp
Negative Logits
uably
-0.81
nowhere
-0.76
naire
-0.73
ļ
-0.71
lied
-0.70
odore
-0.68
uable
-0.67
din
-0.67
WATCHED
-0.67
ertodd
-0.67
POSITIVE LOGITS
sexes
1.74
sides
1.51
genders
1.49
halves
1.34
parties
1.04
Houses
0.98
extremes
0.97
coasts
0.96
ends
0.93
directions
0.85
Activations Density 0.059%