INDEX
Explanations
references to travel and tourism activities
New Auto-Interp
Negative Logits
Drain
-0.15
drained
-0.15
parachute
-0.15
drain
-0.15
arsi
-0.14
soup
-0.14
_fault
-0.14
soup
-0.14
fly
-0.14
ledge
-0.13
POSITIVE LOGITS
fer
0.39
ferry
0.29
Fer
0.28
crossings
0.26
Ferry
0.25
crossing
0.24
fer
0.24
ferr
0.24
Crossing
0.23
cross
0.21
Activations Density 0.017%