INDEX
Explanations
phrases indicating comparison or addition, with an emphasis on the word "as"
the word "as" used in various contexts throughout the document
New Auto-Interp
Negative Logits
ager
-0.80
eks
-0.80
erb
-0.76
reply
-0.74
iddle
-0.74
oder
-0.74
god
-0.74
rones
-0.73
anos
-0.72
amon
-0.72
POSITIVE LOGITS
ensuring
0.85
assorted
0.84
those
0.84
allowing
0.79
providing
0.78
being
0.78
occasional
0.77
other
0.74
some
0.74
optionally
0.74
Activations Density 0.072%