INDEX
Explanations
words related to initial actions or stages of events
the recurring use of the word "initially."
New Auto-Interp
Negative Logits
rf
-0.77
masters
-0.77
Vish
-0.70
Chef
-0.68
cig
-0.67
rom
-0.67
=-=-=-=-=-=-=-=-
-0.67
laws
-0.65
gerald
-0.64
Cth
-0.64
POSITIVE LOGITS
responders
0.89
blush
0.87
conceived
0.80
hesitated
0.80
unsuccessfully
0.78
stages
0.78
appeared
0.77
solic
0.77
plotted
0.76
initialized
0.74
Activations Density 0.013%