INDEX
Explanations
repetitive patterns starting with "Another"
instances of the word "Another," indicating a focus on reiterating or introducing additional points
New Auto-Interp
Negative Logits
hips
-0.90
ouls
-0.85
onies
-0.82
olas
-0.82
bows
-0.74
alties
-0.73
opus
-0.71
riages
-0.69
ships
-0.68
eries
-0.68
POSITIVE LOGITS
example
0.95
worldly
0.94
aspect
0.94
pecul
0.91
drawback
0.90
thing
0.88
reason
0.88
avenue
0.87
notable
0.87
notch
0.86
Activations Density 0.046%