INDEX
Explanations
expressions of personal experiences and relationships with places
New Auto-Interp
Negative Logits
Golf
-0.22
golf
-0.21
hotels
-0.17
hotel
-0.16
vill
-0.16
gol
-0.15
å¾ħ
-0.15
Hotels
-0.15
Hilton
-0.15
ugar
-0.15
POSITIVE LOGITS
Backpack
0.33
backpack
0.32
dorm
0.28
hostel
0.28
Dorm
0.25
shared
0.24
shared
0.24
Host
0.23
Shared
0.23
Shared
0.22
Activations Density 0.024%