INDEX
Explanations
statements summarizing or analyzing arguments
phrases and concepts that relate to arguments and their structure
New Auto-Interp
Negative Logits
lbs
-0.67
roy
-0.66
Temperature
-0.61
hello
-0.61
dies
-0.60
Roses
-0.59
Crew
-0.59
Kobe
-0.58
LOS
-0.58
ggies
-0.57
POSITIVE LOGITS
methodological
0.89
empir
0.84
empirical
0.82
scholarly
0.80
descriptive
0.79
informative
0.78
factual
0.77
quotations
0.77
promulg
0.77
persuasive
0.77
Activations Density 2.116%