INDEX
Explanations
discussions or mentions of differing beliefs and values among individuals
New Auto-Interp
Negative Logits
<?
-0.48
labelledby
-0.47
vaz
-0.45
trono
-0.44
migrationBuilder
-0.43
continuant
-0.42
cmu
-0.42
rxjs
-0.42
番
-0.41
faits
-0.41
POSITIVE LOGITS
disagreement
1.05
divergence
1.03
incompatible
1.00
clash
1.00
differing
0.98
differences
0.97
disagreements
0.97
divergent
0.96
incompatibility
0.95
conflicting
0.93
Activations Density 0.447%