INDEX
Explanations
phrases introducing explanations or additional information
instances of the phrase "is" indicating ongoing states or conditions
New Auto-Interp
Negative Logits
itches
-0.73
actory
-0.68
umbs
-0.66
ievers
-0.64
lex
-0.63
conn
-0.63
icts
-0.62
oice
-0.61
otte
-0.60
icators
-0.60
POSITIVE LOGITS
admittedly
0.98
supposed
0.98
meant
0.95
basically
0.93
presumably
0.87
essentially
0.87
unlikely
0.87
comprised
0.86
currently
0.86
probably
0.86
Activations Density 0.130%