INDEX
Explanations
conjunctions followed by comparisons or explanations
instances of the phrase "more than that."
New Auto-Interp
Negative Logits
ourses
-0.84
izont
-0.75
obal
-0.74
ãĥ³ãĤ¸
-0.70
orously
-0.68
emis
-0.67
istries
-0.67
icons
-0.67
english
-0.66
zos
-0.65
POSITIVE LOGITS
pesky
1.05
happens
0.87
fateful
0.82
happened
0.81
cher
0.78
ched
0.74
.$
0.73
.
0.73
sucker
0.72
.</
0.71
Activations Density 0.079%