INDEX
Explanations
phrases marked with the abbreviation 'WH'
occurrences of repeated or emphasized "WH" phrases and the term "Monitor."
New Auto-Interp
Negative Logits
opausal
-0.64
orbiting
-0.64
default
-0.64
shred
-0.64
racer
-0.63
bru
-0.63
fucking
-0.62
Rolls
-0.62
regression
-0.61
shuff
-0.61
POSITIVE LOGITS
WH
2.33
Monitor
1.65
azar
1.27
itta
1.11
ritz
0.99
Screen
0.96
CHR
0.94
ACP
0.94
YA
0.84
Chambers
0.81
Activations Density 0.023%