INDEX
Explanations
phrases related to supporting or refuting research findings
phrases that indicate support or validation of certain viewpoints or theories
New Auto-Interp
Negative Logits
sembly
-0.85
actionGroup
-0.85
toget
-0.83
abouts
-0.79
Cod
-0.75
phabet
-0.74
yip
-0.70
Events
-0.69
atra
-0.64
meet
-0.64
POSITIVE LOGITS
hypothesis
1.57
claim
1.56
assertion
1.54
notion
1.54
contention
1.51
premise
1.44
theory
1.38
thesis
1.38
assumption
1.34
proposition
1.34
Activations Density 0.309%