INDEX
Explanations
words related to comparison, evaluation, or emphasis
words that indicate emphasis or intensity in statements
New Auto-Interp
Negative Logits
士
-0.68
available
-0.61
anium
-0.60
cream
-0.59
taboola
-0.58
Micro
-0.56
enium
-0.55
unta
-0.55
etus
-0.55
worker
-0.55
POSITIVE LOGITS
unsurprisingly
0.55
Woodward
0.53
Schiff
0.53
McCull
0.52
Miliband
0.51
Harriet
0.50
Starr
0.50
Sharif
0.49
Rosenberg
0.49
unsuccessfully
0.49
Activations Density 0.942%