INDEX
Explanations
phrases related to advocating or arguing for a particular position or viewpoint
phrases related to making arguments or presenting cases
New Auto-Interp
Negative Logits
Seym
-0.69
outage
-0.67
ummer
-0.66
Territories
-0.64
ãĥĺ
-0.64
Dresden
-0.64
onder
-0.62
kered
-0.62
typo
-0.59
ILCS
-0.58
POSITIVE LOGITS
why
0.88
Against
0.85
against
0.81
convinc
0.79
against
0.79
aneers
0.75
persuasive
0.73
compelling
0.71
agine
0.70
arguments
0.70
Activations Density 0.069%