INDEX
Explanations
phrases expressing hope or aspiration
New Auto-Interp
Negative Logits
iÄĻ
-0.07
chron
-0.07
icky
-0.07
amps
-0.06
aidu
-0.06
onian
-0.06
éļª
-0.06
erk
-0.06
weis
-0.06
ÙĪØ±ÙĬØ©
-0.06
POSITIVE LOGITS
idla
0.06
idis
0.06
id
0.06
à¸ģรรม
0.06
ÙĦØŃ
0.06
continued
0.06
luder
0.06
äd
0.06
outcome
0.06
ipi
0.06
Activations Density 0.009%