INDEX
Explanations
proper nouns related to drinks and characters
names or references to specific entities, particularly those related to "Dra."
New Auto-Interp
Negative Logits
Monarch
-0.74
Clover
-0.71
Butterfly
-0.71
Tune
-0.69
hex
-0.66
Hedge
-0.64
Akron
-0.63
Crane
-0.63
Sussex
-0.63
prom
-0.63
POSITIVE LOGITS
rys
0.94
enei
0.91
sten
0.90
izens
0.90
ildo
0.88
cffff
0.86
ven
0.85
izen
0.85
vana
0.84
thur
0.83
Activations Density 0.023%