INDEX
Explanations
references to the concept of "home" and the search for belonging
New Auto-Interp
Negative Logits
osten
-0.15
usch
-0.14
clin
-0.14
rax
-0.13
phan
-0.13
Hiring
-0.13
.localized
-0.13
Basement
-0.13
entes
-0.13
OLON
-0.13
POSITIVE LOGITS
home
0.62
home
0.50
homes
0.46
HOME
0.45
-home
0.43
.home
0.41
Home
0.41
Home
0.40
_home
0.39
HOME
0.39
Activations Density 0.149%