INDEX
Explanations
instances of agreement or disagreement in the text
New Auto-Interp
Negative Logits
Rox
-0.69
Koz
-0.69
Bul
-0.68
Dal
-0.67
="../
-0.65
Mal
-0.65
dal
-0.62
presence
-0.62
IN
-0.61
</h2>
-0.61
POSITIVE LOGITS
Agree
1.64
agrees
1.54
Disagree
1.50
Agre
1.48
agree
1.41
Agreed
1.40
agree
1.39
Agreements
1.37
Agree
1.37
AGRE
1.37
Activations Density 0.101%