INDEX
Explanations
descriptions or summaries of various items or situations
instances of the word "described."
New Auto-Interp
Negative Logits
ffic
-0.84
assi
-0.75
otion
-0.72
eland
-0.72
acus
-0.72
cess
-0.70
purse
-0.67
ggies
-0.67
externalActionCode
-0.65
vernment
-0.65
POSITIVE LOGITS
descriptions
0.91
describ
0.87
described
0.85
REDACTED
0.82
embodiments
0.80
paragraphs
0.80
describes
0.80
aloud
0.77
ĸļ
0.75
outlines
0.72
Activations Density 0.018%