INDEX
Explanations
instances of the word "wonder" and its derivatives, indicating a focus on curiosity and contemplation
New Auto-Interp
Negative Logits
DataManager
-0.15
ssue
-0.14
edb
-0.14
γγ
-0.14
redo
-0.14
绾
-0.13
ces
-0.13
she
-0.13
wy
-0.13
ulated
-0.13
POSITIVE LOGITS
atoria
0.21
ÑĢаÑģÑĤ
0.16
ous
0.15
ocks
0.14
ocker
0.14
ala
0.14
haf
0.14
oad
0.14
anka
0.14
ziej
0.14
Activations Density 0.011%