INDEX
Explanations
variations of the word "diverge" or its derivatives within the text
New Auto-Interp
Negative Logits
tings
-0.17
ings
-0.16
odel
-0.16
makers
-0.16
INGS
-0.15
quir
-0.15
ACS
-0.15
itioner
-0.15
IGNED
-0.15
ÙĪÙĨد
-0.14
POSITIVE LOGITS
gent
0.32
/div
0.30
diver
0.22
gence
0.21
-div
0.21
gent
0.17
divergence
0.17
Div
0.16
erse
0.16
ided
0.16
Activations Density 0.012%