INDEX
Explanations
occurrences of the word "trip" and its variations, indicating travel-related content
New Auto-Interp
Negative Logits
orra
-0.18
hed
-0.17
yne
-0.15
yps
-0.15
ween
-0.14
eper
-0.14
wend
-0.14
iron
-0.14
acen
-0.14
emie
-0.14
POSITIVE LOGITS
licate
0.21
advisor
0.20
Advisor
0.18
|required
0.16
ernel
0.15
åΰçļĦ
0.15
nack
0.14
addslashes
0.14
dale
0.14
ugh
0.14
Activations Density 0.018%