INDEX
Explanations
phrases indicating a question or point of interest
the repetitive use of the word "what."
New Auto-Interp
Negative Logits
robe
-0.72
ster
-0.67
fish
-0.67
Gy
-0.64
ped
-0.63
uttering
-0.63
ohyd
-0.62
trop
-0.62
oyer
-0.62
cean
-0.61
POSITIVE LOGITS
soever
1.26
happens
1.08
happened
1.05
sorts
1.02
happ
0.99
kinds
0.94
transpired
0.92
exactly
0.86
else
0.76
constitutes
0.75
Activations Density 0.129%