INDEX
Explanations
phrases related to something being new, recent, or just starting
variations of the word "fresh" in various contexts
New Auto-Interp
Negative Logits
rael
-0.75
Ĥİ
-0.65
adian
-0.62
oried
-0.62
aylor
-0.61
Donation
-0.60
orem
-0.59
owered
-0.59
incorrectly
-0.59
Loren
-0.59
POSITIVE LOGITS
ness
1.20
lish
1.04
lings
0.81
water
0.81
foundland
0.80
scratch
0.77
fresh
0.76
lins
0.75
cit
0.75
lic
0.75
Activations Density 0.033%