INDEX
Explanations
phrases related to returning or going home
references to the concept of "home."
New Auto-Interp
Negative Logits
"]=>
-0.82
umbers
-0.68
gau
-0.65
gauge
-0.64
ickr
-0.64
immers
-0.64
yz
-0.62
amen
-0.61
enance
-0.61
ggles
-0.60
POSITIVE LOGITS
safely
0.90
ported
0.77
stairs
0.77
right
0.75
opath
0.74
rox
0.74
nikov
0.73
joy
0.72
rehend
0.71
stead
0.70
Activations Density 0.022%