INDEX
Explanations
phrases related to product performance and consumer behavior
New Auto-Interp
Negative Logits
ſever
-0.59
–
-0.57
tranſ
-0.55
ſur
-0.55
juſt
-0.52
faſt
-0.52
viſ
-0.51
eighty
-0.50
ſtand
-0.50
Escherichia
-0.48
POSITIVE LOGITS
ppl
1.18
iirc
1.06
IIRC
1.05
prolly
1.04
shitty
0.97
idk
0.91
IMO
0.91
tbh
0.90
crappy
0.90
Idk
0.89
Activations Density 2.352%