INDEX
Explanations
mentions of the name "Derek."
the name "Derek" and its variants in various contexts
New Auto-Interp
Negative Logits
izations
-1.00
fully
-0.88
ropolitan
-0.82
ocobo
-0.81
hemat
-0.78
hips
-0.77
actionGroup
-0.76
isec
-0.75
akens
-0.75
fulness
-0.75
POSITIVE LOGITS
lers
0.78
irection
0.72
Stadium
0.72
pler
0.69
Carr
0.69
ynamic
0.68
Skywalker
0.68
Bok
0.67
ble
0.66
Clicker
0.65
Activations Density 0.058%