INDEX
Explanations
references to swimming pools and related amenities
New Auto-Interp
Negative Logits
èn
-0.15
Kind
-0.15
confess
-0.14
Smoke
-0.14
çĭ
-0.13
ìĹĨëĬĶ
-0.13
lon
-0.13
543
-0.13
Zug
-0.13
cons
-0.13
POSITIVE LOGITS
bed
0.15
side
0.15
кав
0.15
chop
0.15
ofs
0.15
erman
0.14
otp
0.14
right
0.14
tel
0.14
atsby
0.14
Activations Density 0.017%