INDEX
Explanations
mentions of cycles or repetitive patterns
references to cycles
New Auto-Interp
Negative Logits
inez
-0.93
essors
-0.87
orage
-0.83
avez
-0.82
amina
-0.81
imus
-0.76
oaded
-0.76
azeera
-0.75
iciency
-0.74
initions
-0.74
POSITIVE LOGITS
cycle
1.17
Cycle
0.99
cycle
0.94
cycles
0.93
cycles
0.91
erg
0.85
cles
0.84
repeats
0.77
Delay
0.73
alternating
0.73
Activations Density 0.020%