INDEX
Explanations
phrases indicating searching or attempting different things
phrases that involve the action of seeking or looking for something
New Auto-Interp
Negative Logits
cious
-0.75
aving
-0.73
antry
-0.72
istor
-0.67
emi
-0.65
parable
-0.64
princip
-0.63
fabrication
-0.63
VA
-0.63
eries
-0.63
POSITIVE LOGITS
fitted
0.74
grown
0.70
casts
0.68
è£ıè
0.67
skirts
0.67
stretched
0.66
Eck
0.65
ITNESS
0.65
)=(
0.64
sites
0.61
Activations Density 0.024%