INDEX
Explanations
phrases indicating placement or positioning of objects or people
instances of the word "placed."
New Auto-Interp
Negative Logits
maxwell
-0.73
rick
-0.65
bart
-0.65
docs
-0.63
stay
-0.63
filibuster
-0.61
lin
-0.61
ibus
-0.61
din
-0.59
Wikipedia
-0.59
POSITIVE LOGITS
placed
3.42
placed
1.97
positioned
1.87
placing
1.72
inserted
1.63
situated
1.47
planted
1.44
placement
1.44
erected
1.41
implanted
1.35
Activations Density 0.024%