INDEX
Explanations
the word "Du" followed by numbers
mentions of the name "Du" in various contexts
New Auto-Interp
Negative Logits
ãģĨ
-0.75
Wanted
-0.73
mberg
-0.73
giving
-0.72
ISC
-0.71
nikov
-0.71
Locked
-0.68
Across
-0.68
ħĭ
-0.67
ãĤ¤ãĥĪ
-0.67
POSITIVE LOGITS
pee
0.95
pling
0.93
Du
0.90
ente
0.87
isine
0.86
pees
0.85
cci
0.85
Pont
0.84
ples
0.83
alog
0.82
Activations Density 0.005%