INDEX
Explanations
phrases indicating tension or conflict between different entities
phrases that discuss conflicts or interactions between different groups or entities
New Auto-Interp
Negative Logits
bye
-0.96
nit
-0.81
OGR
-0.79
factor
-0.75
nell
-0.72
ãĤ§
-0.70
ificantly
-0.70
jong
-0.70
\\\\\\\\
-0.70
oultry
-0.69
POSITIVE LOGITS
sexes
1.24
genders
1.12
factions
0.91
two
0.88
combatants
0.86
halves
0.85
ourselves
0.85
spouses
0.84
them
0.83
sides
0.83
Activations Density 0.074%