INDEX
Explanations
phrases suggesting speculation or uncertainty
New Auto-Interp
Negative Logits
wait
-0.69
anding
-0.69
osi
-0.67
catentry
-0.65
aris
-0.62
etting
-0.59
plaintiff
-0.59
center
-0.59
ilant
-0.58
leaf
-0.58
POSITIVE LOGITS
been
1.39
gotten
1.33
been
1.20
gotten
1.12
undergone
1.10
gone
1.10
Been
1.07
originated
1.06
begun
1.04
contributed
1.03
Activations Density 0.091%