INDEX
Explanations
quotations in text
phrases that include the word "as" used to draw comparisons or quotations
New Auto-Interp
Negative Logits
yright
-0.74
width
-0.71
itiveness
-0.66
eeee
-0.65
oller
-0.64
Duration
-0.62
rower
-0.62
osc
-0.62
leness
-0.62
iton
-0.61
POSITIVE LOGITS
follows
1.04
phy
0.83
pires
0.83
criptions
0.80
pell
0.79
soon
0.78
pire
0.78
ĪĴ
0.77
pired
0.77
well
0.76
Activations Density 0.123%