INDEX
Explanations
instances where the word "Whether" appears in the text
instances of the word "Whether"
New Auto-Interp
Negative Logits
GM
-0.73
rome
-0.72
ursed
-0.69
idelines
-0.69
Downs
-0.69
vention
-0.67
WB
-0.64
idates
-0.64
idden
-0.64
acter
-0.64
POSITIVE LOGITS
soever
1.20
theless
0.90
nodd
0.74
consciously
0.73
intentional
0.66
terday
0.63
assetsadobe
0.63
Cly
0.62
whether
0.62
cli
0.61
Activations Density 0.025%