INDEX
Explanations
phrases indicating dashed hopes or aspirations
words related to hopes and aspirations
New Auto-Interp
Negative Logits
proof
-0.77
mint
-0.74
ded
-0.72
ICA
-0.71
mans
-0.71
onel
-0.69
ctors
-0.69
nce
-0.68
pmwiki
-0.67
phys
-0.63
POSITIVE LOGITS
pring
1.21
omething
1.09
peed
1.07
cape
1.01
ystem
1.00
hops
0.98
hooting
0.93
chool
0.93
mith
0.92
creen
0.90
Activations Density 0.111%