INDEX
Explanations
references to lively events or festive occasions, particularly those related to celebrations or parties
New Auto-Interp
Negative Logits
,
-0.53
caffè
-0.47
in
-0.45
.
-0.44
f
-0.43
Tse
-0.41
Y
-0.41
(
-0.41
↵
-0.41
and
-0.41
POSITIVE LOGITS
pleaſure
0.98
myſelf
0.89
EconPapers
0.88
LookAnd
0.87
Diſ
0.87
Theſe
0.87
Reſ
0.86
Jefus
0.86
ñata
0.84
IntoConstraints
0.83
Activations Density 0.131%