INDEX
Explanations
statements starting with "You can't"
instances of the phrase "you can't" or variations thereof
New Auto-Interp
Negative Logits
çīĪ
-0.75
RAD
-0.74
Palest
-0.71
enegger
-0.71
BuyableInstoreAndOnline
-0.70
Sandwich
-0.66
Gleaming
-0.65
Circus
-0.63
theaters
-0.62
colonial
-0.61
POSITIVE LOGITS
£
1.02
Ĵ
0.96
lege
0.95
Ķ
0.95
«
0.95
ı
0.93
º
0.91
ĵ
0.91
ij
0.91
¨
0.90
Activations Density 0.115%