INDEX
Explanations
pairs or groups of entities or topics discussed together
references to duality or items expressed in pairs
New Auto-Interp
Negative Logits
renheit
-0.84
ushima
-0.75
uable
-0.75
ugu
-0.75
frac
-0.72
Charg
-0.70
naire
-0.70
duc
-0.70
nect
-0.69
cast
-0.69
POSITIVE LOGITS
sides
1.50
sexes
1.43
parties
1.28
genders
1.23
halves
1.09
Houses
0.91
factions
0.91
Clintons
0.89
spouses
0.88
Parties
0.86
Activations Density 0.063%