INDEX
Explanations
references to various types of cycles or cyclical patterns
New Auto-Interp
Negative Logits
myſelf
-1.31
pleaſure
-1.21
cauſe
-1.13
raiſ
-1.13
purpoſe
-1.13
ſeveral
-1.11
Reſ
-1.11
ſche
-1.11
uſed
-1.09
consultato
-1.08
POSITIVE LOGITS
cycle
0.81
Dal
0.73
cycle
0.68
dal
0.66
into
0.64
Cycle
0.63
Cycle
0.60
Dal
0.60
INTO
0.57
into
0.56
Activations Density 0.197%