INDEX
Explanations
phrases related to conflicts of interest
New Auto-Interp
Negative Logits
élevées
-0.81
dedans
-0.79
enterOuterAlt
-0.76
réduite
-0.75
sauvages
-0.74
pousser
-0.72
StructEnd
-0.69
chrétienne
-0.69
efficaces
-0.68
régulière
-0.68
POSITIVE LOGITS
conflict
1.68
conflicts
1.68
Conflicts
1.57
Conflict
1.53
Conflicts
1.40
CONFLICT
1.37
conflict
1.35
conflic
1.30
Conflict
1.27
conflits
1.24
Activations Density 0.088%