INDEX
Explanations
phrases related to various scenarios, including experiences with certain conditions and discussing topics in a nuanced or precise way
instances of the word "have."
New Auto-Interp
Negative Logits
Apart
-0.75
catentry
-0.64
icking
-0.58
inance
-0.56
oshi
-0.56
fiasco
-0.55
arter
-0.55
debacle
-0.54
osa
-0.54
settlement
-0.54
POSITIVE LOGITS
been
1.36
been
1.15
Been
1.06
gotten
1.02
seen
0.93
undergone
0.92
gone
0.88
gotten
0.88
begun
0.87
kell
0.87
Activations Density 0.316%