INDEX
Explanations
statements and discussions related to arguments or claims made in a debate
New Auto-Interp
Negative Logits
OSI
-0.54
Dina
-0.50
Whitby
-0.46
MAO
-0.46
methylene
-0.45
)•
-0.45
INA
-0.45
detectChanges
-0.45
‹
-0.45
atibility
-0.44
POSITIVE LOGITS
argument
1.46
arguments
1.38
argue
1.36
argument
1.35
argued
1.29
arguments
1.27
Argument
1.27
Argument
1.27
argues
1.22
arguing
1.20
Activations Density 0.232%