INDEX
Explanations
phrases indicating a sequence of events or actions
phrases that start with "Having," which implies a focus on past experiences or actions
New Auto-Interp
Negative Logits
etter
-0.64
Echo
-0.62
Slovakia
-0.61
Interested
-0.60
Avalanche
-0.58
tel
-0.57
kicker
-0.57
etting
-0.57
ère
-0.56
VR
-0.56
POSITIVE LOGITS
undergone
1.18
been
1.14
eaten
0.97
gotten
0.93
begun
0.90
seen
0.90
gone
0.89
arisen
0.87
learnt
0.86
tasted
0.86
Activations Density 0.031%