INDEX
Explanations
contractions of specific phrases like "don't" and variations where "t" is used instead of an apostrophe
negative contractions indicating rejection or refusal
New Auto-Interp
Negative Logits
accompan
-0.69
ocative
-0.66
Darling
-0.61
ricular
-0.58
forms
-0.57
rising
-0.57
anni
-0.56
leck
-0.55
itaire
-0.55
ò
-0.54
POSITIVE LOGITS
ourselves
1.38
know
0.91
ird
0.91
condone
0.91
underestimate
0.87
need
0.86
expect
0.84
bsite
0.81
asel
0.80
hear
0.79
Activations Density 0.104%