INDEX
Explanations
phrases indicating a statement or belief
assertions and claims made in a discussion or analysis
New Auto-Interp
Negative Logits
MpServer
-0.77
ãĤ¼ãĤ¦ãĤ¹
-0.72
SourceFile
-0.67
Scroll
-0.66
ipers
-0.65
apsed
-0.64
sie
-0.63
Tank
-0.63
lled
-0.62
mouth
-0.62
POSITIVE LOGITS
policymakers
0.90
proponents
0.85
perceptions
0.74
progressives
0.74
historians
0.73
economists
0.73
although
0.71
interpreting
0.70
comparisons
0.69
theorists
0.69
Activations Density 0.473%