INDEX
Explanations
the word "again" as a recurring pattern across different contexts
the repeated phrase "again."
New Auto-Interp
Negative Logits
rament
-0.76
anooga
-0.68
ottage
-0.68
kees
-0.67
izons
-0.65
Jugg
-0.64
eties
-0.64
avez
-0.61
company
-0.61
vice
-0.60
POSITIVE LOGITS
nces
0.77
forth
0.76
ESV
0.71
Burr
0.67
theless
0.66
Thomson
0.66
ctr
0.65
nir
0.65
Transcript
0.65
repeats
0.64
Activations Density 0.022%