INDEX
Explanations
phrases indicating a comparison or contrast
phrases that emphasize the word "so" as a modifier to convey varying degrees of emphasis or comparison
New Auto-Interp
Negative Logits
Aren
-0.64
neum
-0.63
MAP
-0.62
Ethics
-0.60
(>
-0.59
osterone
-0.58
Encyclopedia
-0.57
witz
-0.57
IED
-0.56
ertodd
-0.55
POSITIVE LOGITS
much
1.11
lucky
0.97
forgiving
0.94
subtly
0.91
easy
0.90
fortunate
0.88
simple
0.88
bered
0.87
easily
0.86
bad
0.85
Activations Density 0.035%