INDEX
Explanations
repeated patterns or sequences, particularly in letters or character strings
New Auto-Interp
Negative Logits
Theſe
-0.95
,:);
-0.94
houſe
-0.85
₂)
-0.84
dieß
-0.82
—
-0.80
myſelf
-0.80
་་
-0.78
―――――
-0.78
)»
-0.77
POSITIVE LOGITS
enegro
0.81
ckner
0.63
t
0.58
t
0.57
ST
0.54
piatta
0.54
GenerationType
0.54
r
0.54
RRRR
0.53
x
0.53
Activations Density 0.004%