INDEX
Explanations
instances of the word "pants" in the text
occurrences of the word "pants"
New Auto-Interp
Negative Logits
Reviewed
-0.74
Flavoring
-0.67
=-=-=-=-
-0.67
Galile
-0.66
AUT
-0.66
Fund
-0.64
Democr
-0.64
SPONSORED
-0.64
Canaver
-0.64
ILY
-0.63
POSITIVE LOGITS
pants
1.19
trousers
1.16
pants
1.14
uits
1.03
bag
1.00
leeve
1.00
belt
0.98
uit
0.95
jeans
0.94
shirt
0.94
Activations Density 0.009%