INDEX
Explanations
phrases indicating emphasis or importance
the preposition "in" and its contextual significance
New Auto-Interp
Negative Logits
indal
-0.88
skirts
-0.88
FTWARE
-0.78
mber
-0.77
duction
-0.74
igers
-0.72
ptives
-0.71
APTER
-0.69
lees
-0.68
iris
-0.68
POSITIVE LOGITS
paraph
0.84
essence
0.74
guessed
0.72
paradox
0.72
quote
0.71
aggregate
0.68
plaus
0.68
surprisingly
0.67
incidentally
0.67
probability
0.66
Activations Density 0.125%