INDEX
Explanations
terms related to contrast and comparisons
New Auto-Interp
Negative Logits
SupportActionBar
-0.77
estekak
-0.56
LookAnd
-0.55
martyrs
-0.51
PSS
-0.50
lagem
-0.50
oluyor
-0.50
kasarigan
-0.50
verantwoorde
-0.49
approve
-0.48
POSITIVE LOGITS
conflicting
1.18
contradictory
1.15
inconsistent
1.07
contradictions
1.05
contradiction
1.03
inconsistency
1.02
inconsistencies
0.94
contradic
0.94
contradicts
0.89
contradict
0.89
Activations Density 0.141%