INDEX
Explanations
phrases involving the word "pants"
New Auto-Interp
Negative Logits
Scotia
-0.81
Flavoring
-0.75
KNOWN
-0.75
Galile
-0.71
inosaur
-0.70
Kern
-0.67
illery
-0.67
rian
-0.65
rious
-0.63
Democr
-0.63
POSITIVE LOGITS
trousers
1.10
jeans
1.07
hirt
1.06
pants
1.05
sleeves
1.01
straps
0.99
uits
0.97
worn
0.97
pins
0.97
bag
0.95
Activations Density 0.028%