INDEX
Explanations
the word "what"
the pronoun "what" followed by a verb or a noun
occurrences of the word "what."
New Auto-Interp
Negative Logits
ster
-0.67
trop
-0.67
fish
-0.66
robe
-0.66
uttering
-0.63
ped
-0.62
Gy
-0.61
por
-0.61
eer
-0.59
ohyd
-0.59
POSITIVE LOGITS
soever
1.24
happens
1.09
happened
1.06
sorts
1.00
happ
0.98
kinds
0.97
transpired
0.91
exactly
0.84
else
0.79
constitutes
0.78
Activations Density 0.141%