INDEX
Explanations
the phrase "all the time"
the repeated phrase "all the time."
New Auto-Interp
Negative Logits
ync
-0.73
FUL
-0.66
PLA
-0.63
ful
-0.62
ndum
-0.60
arium
-0.60
Listener
-0.59
eming
-0.58
mma
-0.58
SPONSORED
-0.58
POSITIVE LOGITS
way
1.31
time
1.06
ways
0.93
WAY
0.91
WAY
0.89
goddamn
0.87
way
0.86
same
0.85
time
0.80
together
0.79
Activations Density 0.031%