INDEX
Explanations
instances of "what" followed by a comparison or contrast in the text
instances of the word "what" in various contexts
New Auto-Interp
Negative Logits
enburg
-0.69
ster
-0.65
eer
-0.62
jee
-0.62
robe
-0.61
ature
-0.60
adden
-0.59
Sao
-0.59
Gamb
-0.58
isher
-0.58
POSITIVE LOGITS
happened
1.15
happens
1.15
soever
1.12
transpired
1.02
happ
0.96
constitutes
0.94
kinds
0.90
constituted
0.87
sorts
0.85
else
0.83
Activations Density 0.105%