INDEX
Explanations
phrases related to conflicts of interest
New Auto-Interp
Negative Logits
realistas
-0.79
dedans
-0.77
Edm
-0.76
réaliste
-0.76
chrétienne
-0.76
citoy
-0.75
réduite
-0.74
pousser
-0.73
varandra
-0.73
élevées
-0.73
POSITIVE LOGITS
conflict
2.33
conflicts
2.21
Conflict
2.10
conflict
1.94
Conflicts
1.94
Conflict
1.83
CONFLICT
1.73
Conflicts
1.71
conflits
1.66
conflit
1.65
Activations Density 0.093%