INDEX
Explanations
instances where the document discusses comparisons or contrasts between two entities
references to comparisons or relationships involving two entities
New Auto-Interp
Negative Logits
vich
-0.78
xus
-0.69
renheit
-0.69
daq
-0.69
ÃĽ
-0.68
INESS
-0.68
Absent
-0.68
orney
-0.68
itness
-0.66
APTER
-0.65
POSITIVE LOGITS
halves
1.23
extremes
1.20
sexes
1.19
sides
1.03
poles
0.97
genders
0.90
parties
0.86
thirds
0.82
combatants
0.81
continents
0.81
Activations Density 0.063%