INDEX
Explanations
phrases indicating depth or additional layers of meaning beyond the surface level
references to the concept of depth or deeper understanding
New Auto-Interp
Negative Logits
advertising
-0.81
ULE
-0.78
EED
-0.76
WATCHED
-0.76
Counter
-0.74
Athlet
-0.72
SR
-0.71
EVA
-0.71
DOM
-0.68
Guard
-0.68
POSITIVE LOGITS
deeper
0.96
depth
0.94
penetration
0.93
depths
0.92
insight
0.88
vein
0.85
maturity
0.83
penet
0.81
ingred
0.80
layers
0.80
Activations Density 0.005%