INDEX
Explanations
the word "premium" and words that often appear nearby like "up"
New Auto-Interp
Negative Logits
__))
-0.77
ⓧ
-0.71
:+:
-0.63
);?>
-0.60
setup
-0.56
,
-0.56
__);
-0.55
(
-0.55
を受けて
-0.54
LikeLike
-0.54
POSITIVE LOGITS
auffi
1.06
Efq
1.00
faſt
0.96
pleaſure
0.95
Monfieur
0.94
ainfi
0.91
ſeveral
0.90
Theſe
0.89
Houſe
0.89
myſelf
0.89
Activations Density 0.374%