INDEX
Explanations
references to the concept of "home" and related actions or states
"home" followed by prepositions or punctuation
returning home
New Auto-Interp
Negative Logits
loom
-0.38
leap
-0.37
formations
-0.37
omo
-0.36
thrift
-0.35
dun
-0.35
simil
-0.35
قد
-0.34
curbs
-0.34
havan
-0.34
POSITIVE LOGITS
pulang
0.97
homeward
0.95
thuis
0.80
回家
0.79
Returning
0.79
repatri
0.78
returning
0.77
帰宅
0.74
home
0.74
домой
0.72
Activations Density 0.224%