INDEX
Explanations
contradictory or contrasting statements
New Auto-Interp
Negative Logits
Fairchild
-0.79
ftagPool
-0.79
CanadaChoose
-0.78
-0.78
disambiguazione
-0.76
">$
-0.75
?>>
-0.75
brainly
-0.74
Portale
-0.73
היתה
-0.72
POSITIVE LOGITS
However
2.08
However
1.93
But
1.61
But
1.45
Nevertheless
1.30
Therefore
1.27
Therefore
1.26
Nevertheless
1.25
Furthermore
1.21
Furthermore
1.18
Activations Density 0.067%